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ABSTRACT 

Using the conditional luminosity function (CLF) — the luminosity distribution of 
galaxies in a dark matter halo — as the fundamental building block, we present an 
empirical model for the galaxy distribution. The model predictions are compared with 
the published luminosity function and clustering statistics from Sloan Digital Sky Sur- 
vey (SDSS) at low redshifts, and galaxy correlation functions from COMBO-17 survey 
at a redshift of 0.6, DEEP2 survey at a redshift of unity, Great Observatories Deep 
Origins Survey (GOODS) at a redshift around 3, and Subaru/XMM- Newton Deep 
Field data at a redshift of 4. The comparison with statistical measurements allows 
us to constrain certain parameters related to analytical descriptions on the relation 
between a dark matter halo and its central galaxy luminosity, its satellite galaxy lu- 
minosity, and the fraction of early- and late-type galaxies of that halo. With the SDSS 
r-band LF at M r < —17, the log-normal scatter in the central galaxy luminosity at 
a given halo mass in the central galaxy-halo mass, L C (M), relation is constrained to 
be 0.171q'q^, with 1 a errors here and below. For the same galaxy sample, we find no 
evidence for a low-mass cut off in the appearance of a single central galaxy in dark 
matter halos, with the 68% confidence level upper limit on the minimum mass of dark 
matter halos to host a central galaxy, with luminosity M r < —17, is 2 x 10 10 h^ 1 
Mq. On the other hand, the appearance of satellites with luminosities M r < —17 at 
z < 0.1, using a total luminosity-halo mass relation of the form L C (M)(M/M sa t)j, 
is constrained with SDSS to be at a halo mass of M sat = (1.2j^) x 10 13 /i -1 Mq 
with a power-law slope (3 S of (0.56^;^). At z ~ 0.6, COMBO-17 data allows these 
parameters for Mb < —18 galaxies to be constrained as (3.3+g'g) x 10 13 h~ x Mq and 
(O.62IQ 27), respectively. At z ~ 4, Subaru measurements constrain these parameters 
for M B < -18.5 galaxies as (4.12+jj;^) x 10 12 hr 1 Mq and (0.55±g;|§), respectively. 
The redshift evolution associated with these parameters can be described as a combi- 
nation of the evolution associated with the halo mass function and the luminosity-halo 
mass relation. The single parameter well constrained by clustering measurements is 
the average of total satellite galaxy luminosity corresponding to the dark matter halo 
distribution probed by the galaxy sample. For SDSS, (L sat ) = (2.1+°;!) x 10l ° h ~ 2 
L Q , while for GOODS at z ~ 3, (£ sat ) < 2 x 10 11 h~ 2 L Q . For SDSS, the fraction 
of galaxies that appear as satellites is 0.13lg;g|, O.llj^, O.ll+^JI, and 0.12+.°; 33 
for galaxies with luminosities in the r'-band between -22 to -21, -21 to -20, -20 to 
-19, and -19 to -18, respectively. In addition to constraints on central and satellite 
CLFs, we also determine model parameters of the analytical relations that describe 
the fraction of early- and late-type galaxies in dark matter halos. We use our CLFs to 
establish probability distribution of halo mass in which galaxies of a given luminosity 
could be found either at halo centers or as satellites. Finally, to help establish further 
properties of the galaxy distribution, we propose the measurement of cross-clustering 
between galaxies divided into two distinctly different luminosity bins. Our analysis 
show how CLFs provide a stronger foundation to built up analytical models of the 
galaxy distribution when compared to models based on the halo occupation number 
alone. 
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1 INTRODUCTION 

The conditional luminosity function (CLF; Yang et al. 
2003b, 2005), or the luminosity and color distribution of 
galaxies within a dark matter halo of mass M, <f>(L, c|M), 
captures important astrophysical information that deter- 
mines how the large scale structure galaxy distribution 
is related to that of the dark matter. As shown in 
Cooray & Milosavljevic (2005b; Cooray 2005a), a simple 
empirical model for the CLF, when combined with the 
halo mass function, describes the galaxy luminosity func- 
tion (LF) accurately; This empirical model recovers the 
Schechter (1976) form of the galaxy LF given by ${L) oc 
(L/L*) a exp(— L/L*) with a characteristic luminosity 
and a power-law slope at the faint end of a. A basic ingredi- 
ent in this model is the relation between central galaxy lumi- 
nosity and the mass of the halo in which the galaxy is found 
— the L C (M) relation of Cooray & Milosavljevic (2005a). 
The characteristic luminosity of the Schechter function is 
the luminosity when the scatter in the L C (M) relation, at 
a given halo mass, dominates over the increase in the lumi- 
nosity with mass or when d\n L c /d\n M ~ ln(10)cr where 
a is the log-normal dispersion in the L C (M) relation. Given 
the observed dispersion, we find M* k 2 x 10 13 Mq and 
L-t,, corresponding to L c (M ir ), agrees with estimates from 
observations. The faint-end power-law slope of the LF is a 
combination of the power-law slope of the L C (M) relation at 
M <C Mi, and the slope of the dark matter halo mass func- 
tion. The combination puts a strict bound that a < —1.25, 
consistent with observations that indicate a « —1.3 (Blan- 
ton et al. 2004; Huang et al. 2003). 

The L C (M) relation, as appropriate for galaxies at low- 
redshifts and in the K-band, was established in Cooray & 
Milosavljevic (2005a) from a combination of weak galaxy- 
galaxy lensing data (e.g., Sheldon et al. 2004; Yang et al. 
2003a) and direct measurements of galaxy luminosity and 
mass in groups and clusters (e.g., Lin et al. 2004; Lin & 
Mohr 2004). The same relation, as appropriate for lower 
wavelengths, has been established with a statistical analy- 
sis of the 2dF Galaxy Redshift Survey (2dFGRS; Colless et 
al. 2001) fej-band LF (e.g., Norberg et al. 2002) by Vale & 
Ostriker (2004) and, independently, by Yang et al. (2005) 
based on the 2dFGRS galaxy group catalog. The shape of 
the L C (M) relation, where luminosities grow rapidly with 
increasing mass but flattens at a mass scale around ~ 10 13 
Mq is best explained through dissipationless merging his- 
tory of central galaxies (Cooray & Milosavljevic 2005a). A 
large fraction of these bright galaxies, in centers of groups 
and clusters, are early-type and observational evidences for 
dry mergers as a dominant process in the formation and 
evolution of massive, luminous early-type galaxies are now 
beginning to appear (Bell et al. 2005). In Cooray (2005b), 
the approach based on CLFs was extended to higher red- 
shifts and a comparison with galaxy LFs observed out to 
redshifts 2 and higher, with surveys such as DEEP2 (Davis 
et al. 2003; Willmer et al. 2005; Faber et all. 2005) and 
COMBO-17 (Wolf et al. 2001, 2003; Bell et al. 2004), al- 
lowed constraints on the redshift evolution of the L C (M) 
relation. 

Beyond the total galaxy LF, the empirical modeling ap- 



proach based on CLFs can easily be extended to consider 
statistics of galaxy types as well. For example, in Cooray 
(2005a), we studied the environmental dependence of galaxy 
colors, broadly divided into blue- and red-galaxies given the 
bimodal nature of the color distribution (e.g., Baldry et al. 
2004; Balogh et al. 2004). There, we described early- and 
late-type conditional LFs measured with 2dFGRS as a func- 
tion of the galaxy overdensity (Croton et al. 2004) based on 
an empirical description of the fraction of late- and early- 
type galaxies relative to the total number of galaxies in dark 
matter halos as a function of the halo mass. With an in- 
creasing fraction of early-type galaxies as the halo mass is 
increased, the simple analytical model considered in Cooray 
(2005a) explains why the LF of galaxies in dense environ- 
ments is dominated by red galaxies. 

While LFs are well produced by this analytical model, 
with statistics related to early- and late-type galaxies pa- 
rameterized by analytical functions, it is timely to consider 
how this model compares with higher order statistics of the 
galaxy distribution such as those related to the clustering 
pattern of galaxies. A comparison to data could also pro- 
vide additional constraints on ingredients related to this 
model and especially those related to CLFs of central and 
satellite galaxies. While numerous predictions and limited 
comparisons to data exist in the literature on how well clus- 
tering measurements can constrain galaxy properties, these 
are mostly considered in terms of the simple halo occupa- 
tion model involving the number of galaxies in a dark matter 
halo as a function of the halo mass, N g (M) (e.g., Seljak 2000; 
Scoccimarro et al. 2001; Berlind & Weinberg 2002; Cooray 
2002; Berlind et al. 2003; Zehavi et al. 2004; Zheng et al. 

2004) . 

The same approach of constraining the halo occupation 
number based on galaxy clustering data has been applied 
at redshifts ~ 0.6 with COMBO-17 (Phleps et al. 2005) 
and, more frequently, at ~ 3 and higher using Lyman-break 
galaxies (LBGs) and similar drop-out samples (e.g., Bullock 
et al. 2002; Lee et al. 2005). Since the halo occupation num- 
ber is an integral function and treats all galaxies the same, 
regardless of the color or the galaxy luminosity, meaningful 
models that account for differences in galaxy physical pa- 
rameters cannot easily be considered. It is also no surprise 
that the halo model, based on the halo occupation number 
alone, cannot be used to model the LF of galaxies. Even with 
clustering statistics, due to differences involving luminosities 
of galaxies in different samples and potential variations with 
redshift when defining galaxy samples, constraints on the 
simple halo occupation number at different redshifts cannot 
be compared with each other easily. 

The best approach to overcome these drawbacks is to 
make use of the conditional occupation number or, more 
appropriately, CLF as the fundamental quantity when de- 
scribing galaxy statistics (Yang et al. 2003b, 2005). The 
CLFs extend the analytical halo model (see, Cooray & 
Sheth 2002 for a review) by dividing the mean number of 
galaxies to a distribution in galaxy luminosity such that 
$(L|M) = dN g (M)/dL and using $(L|M) to model ob- 
served statistics rather than N g (M) (Yang et al. 2003b, 

2005) . Similarly, the same approach can be extended for 
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galaxy color or any other property as one can easily define 
the subsample of galaxies related to that property, but with 
the restriction that the whole sample be contained within 
the total LF. Thus, the approach based on CLFs is useful 
when comparing with measurements conditioned in terms of 
galaxy properties such as the luminosity or the color. With 
wide-field surveys, where statistics of a few hundred thou- 
sand galaxies or more are easily available, the division of 
measurable statistics to galaxy properties is now common. 
Given the existence of measurements already, for example 
clustering properties as a function of the galaxy luminosity 
(Zehavi et al. 2004) or galaxy-mass cross-correlation through 
galaxy-galaxy lensing studies as a function of the galaxy lu- 
minosity (Mandelbaum et al. 2004; Sheldon et al. 2004), the 
need for an improved halo model is clear. 

Here, we extend our previous discussions related to 
CLFs (Cooray 2005a, 2005b), where we modeled the LF, to 
study galaxy clustering and make predictions for clustering 
statistics at the two-point level involving projected correla- 
tion functions as a function of the redshift. These models 
are then compared with existing results from surveys such 
as Sloan Digital Sky Survey (SDSS; York et al, 2000; Zehavi 
ct al. 2004) at redshifts less than 0.1, COMBO-17 survey 
at redshifts between 0.4 and 0.8 (Wolf et al. 2001, 2003; 
Phleps et al. 2005), DEEP 2 survey (Davis et al. 2003; Coil 
et al. 2004) at a redshift around unity, Great Observatories 
Deep Origins Survey (GOODS; Lee et al. 2005) at a redshift 
around 3, and Subaru/XMM-Newton Deep Field LBG clus- 
tering at a redshift of 4 (Ouchi et al. 2005) to derive general 
constraints on the underlying CLF. 

A previous attempt related to extracting properties of 
the galaxy sample in 2dFGRS through CLFs is described in 
Yang et al. (2005). In this analysis, authors made use of a 
priori assumed Schechter function shape for the CLF (Yang 
et al. 2003b) , though we make no such assumptions here. In 
fact, galaxy cluster or group LF data suggest that Schechter 
function shapes are not the appropriate form to describe 
their luminosity distribution, given the central galaxy. A 
combination of a log-normal component and a power-law fits 
the data best, consistent with the CLF models we have sug- 
gested. Motivated by Yang et al. (2003b), Yan et al. (2003) 
used the same CLF description involving Schechter func- 
tion shapes to compare galaxy clustering between 2dFGRS 
and DEEP2 and suggested that the CLF does not strongly 
evolve with redshift. 

While we make use of SDSS clustering measurements 
in our analysis here, we also note that an attempt has been 
made to establish $(Z/|M) based on differences in halo occu- 
pation models, as a function of luminosity, when describing 
clustering statistics as a function of the galaxy luminosity 
(Zehavi et al. 2004). The modeling approach we use here in- 
volves the CLF as the basic parameter to be extracted from 
the data and provides a consistent way to model both the 
galaxy LF and clustering statistics of the same galaxy sam- 
ple and a mechanism to extend the same underlying CLF 
model to describe galaxy statistics at higher redshifts. Since 
CLFs are recovered, we can easily integrate over the lumi- 
nosity to calculate halo occupation numbers allowing an easy 
comparison to previous analyses. Our approach also demon- 



strates why and how certain parameters related to CLFs 
are sensitive to LFs, such as those related to central galax- 
ies, while others, especially those involving satellite galaxies, 
can be determined better with the non-linear 1-halo part of 
the correlation function. This is consistent with suggestions 
in the literature, that, for example, halo occupation statistics 
— which are dominated by satellite galaxies — are better 
constrained with clustering of galaxies within groups (e.g., 
Coil et al. 2005; Collister & Lahav 2005). 

The paper is organized as follows: In the next section, 
we will outline basic ingredients in the empirical model for 
CLFs and how galaxy clustering statistics can be derived 
from CLFs. We refer the reader to Cooray & Milosavljevic 
(2005b) and Cooray (2005a) for initial discussions related to 
this empirical modeling approach and to Cooray (2005b) for 
details on the extension to higher redshifts. Previous studies 
related to the CLF, involving mostly the 2dFGRS catalog, 
are described in Yang et al. (2003b, 2005). In Section 3, we 
present a comparison of our model with the observed LF, 
and LFs of galaxy types, from SDSS and in Section 4, we 
extract parameters related to CLFs as a function of the red- 
shift from SDSS clustering at redshifts below 0.1 to Subaru 
data at redshifts ~ 4. In Section 5, we conclude with a sum- 
mary of our main results and implications related to the 
galaxy distribution and propose several new statistics that 
can help constrain CLFs better. Throughout the paper we 
assume cosmological parameters consistent with most obser- 
vational analyses of measurements modeled here and take 
f2 m = 0.3, £3a = 0.7, and a scaled Hubble constant of h — 1 
in units of 100 km s _1 Mpc -1 , unless otherwise stated ex- 
plicitly. 



2 CONDITIONAL LUMINOSITY FUNCTIONS: 
AN OVERVIEW 

In order to construct galaxy clustering statistics as a func- 
tion of redshift, we follow Cooray & Milosavljevic (2005b) 
and Cooray (2005) to define the redshift-dependent condi- 
tional luminosity function (CLF; Yang et al. 2003b, 2005), 
denoted by <1?(L|M, z), giving the average number of galax- 
ies with luminosities between L and L + dL that resides in 
halos of mass M at a redshift of z. As in our previous ap- 
plications, the CLF is separated into terms associated with 
central and satellite galaxies, such that 



3>(L|M, z) 
<S> ccn {L\M,z) 



$ sat (L|M,z) 



$ ccn (L\M,z) +$ sat (LjM, 2 ) 

/cen(M,«) 



27iTn(10)cr con L 

log 10 [L/L c (M,z)] 2 

2<7 C cn 



cxp < - 



A{M,z)L^ M) g B {L) 



(1) 



Here, / con (M, z) is a selection function introduced to ac- 
count for the efficiency for galaxy formation as a function of 
the halo mass given the fact that at low mass halos galaxy 
formation may be inefficient and not all dark matter halos 
may host a galaxy: 
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m (h~ 1 M syn ) 

Figure 1. The fraction of early- (red lines) and late-type (blue 
lines) galaxies, both appearing as central and satellite galaxies, 
relative to the total number of galaxies in dark matter halos, as a 
function of the halo mass. For reference, we also show the fraction 
of total central (dotted lines) and satellite galaxies (dashed lines). 
These fractions assume fiducial values for various model parame- 
ters, appropriate for SDSS galaxies with M r < —17, as discussed 
in the text. At halo masses below 10 13 Mq fractions are deter- 
mined by central galaxies; For low-mass halos the fraction of latc- 
type galaxies is close to 0.8, while the same fraction for high mass 
halos, dominated by satellites, is ~ 0.3. In addition to the halo 
mass, the early- and late-type fraction of satellite galaxies also 
depends on the galaxy luminosity. Here, we show the fractions 
for satellites with luminosity of 10 10 Lq. Later, based on param- 
eter constraints, we will highlight the satellite galaxy fraction at 
a given galaxy luminosity and show that, while some parame- 
ters such as the total satellite luminosity is well constrained, the 
fraction is not. 



io- 1 




-23 -21 -19 -17 



M (R-band) 

Figure 2. The LF of SDSS galaxies in the r-band from Blanton 
et al. (2003; 2004). We concentrate here only on galaxies with 
M r < — 17 as these form the sample used by Zehavi et al. (2004) 
for galaxy clustering measurements. In addition to the total lumi- 
nosity function — we show both the uncorrected and corrected 
estimates as filled symbols with error bars (see, Blanton et al. 
2004 for details) — we also show the LF of early- and late-type 
galaxies (open symbols). For clarity, we do not plot the error bars 
for the LF of galaxy types, but they are at the same level as that 
of the total sample. The curves show the predictions based on 
CLFs, with fiducial best-fit parameters as described in the text. 
The dotted lines show contribution from central galaxies, while 
the dashed lines show satellites. The solid lines show the total 
galaxy LF as predicted in this model. As shown, and discussed 
in Cooray (2005a), central galaxies dominate LF statistics; as we 
find later, parameters related to central galaxies are better deter- 
mined with LFs when compared to the information provided in 
clustering measurements. 



f ccn (M,z) = - 



1 + erf 



/ log(M) - log(M c , 



(*)) 



•(2) 



The motivation for the separation of galaxies to central 
and satellite galaxies is numerous: from theory, a better de- 
scription of the galaxy occupation statistics is obtained when 
one separates to central and satellite galaxies (Kravtsov 
et al. 2004), while from observations, central and satellites 
galaxies are known to show different properties, such as color 
and luminosity (e.g., Berlind et al. 2004). In our fiducial de- 
scription, we will take numerical values of M ccn „ cu t = 10 10 
M sun and a = 0.5. 

We introduced the selection function f ccn (M,z) in 
Eq. (2) in Cooray (2005a) to explain the faint-end slope 
of the 2dFGRS LF with a value of ~ -1.05 (Norberg et 
al. 2002). When considering model fits to the data with 
Af ccn _ cu t as a free parameter, we find that this parame- 
ter can only be constrained as an upper limit with SDSS 
data. As we discuss later, the lack of a clear constraint on 
A/c 0n _ C ut in our model fits differs from analysis based on 
halo occupation numbers where a minimum mass for the 
presence of galaxies in halos is usually suggested. The mini- 
mum mass in halo occupation number generally corresponds 



to the minimum mass for halos that host galaxies at the low- 
end of the galaxy luminosity distribution probed with the 
data and such a cut-off is naturally present in models related 
to CLFs. 

In Equation 1, L C (M, z) is the relation between central 
galaxy luminosity of a given dark matter halo and it's halo 
mass, taken to be a function of the redshift, while <r ccn , with 
a fiducial value of 0.17, is the log-normal dispersion in this 
relation. For an analytical description of the L c (M,z) rela- 
tion, we make use of the form suggested by Vale & Ostriker 
(2004) where this relation as appropriate for bj-band galax- 
ies today was established by inverting the 2dFGRS luminos- 
ity function given an analytical description for the sub-halo 
mass function of the Universe (e.g., De Lucia et al. 2004; 
Oguri & Lee 2004). The relation is described with a general 
fitting formula given by 



L c (M,z) = L (l + z) a 



(MjM\ 



[b+ (M/Mi) cd ( 1 +*)' ? ] 1 /d 



(3) 



For the rest B-band, the parameters have values of Lo = 
5. 7x 10 9 L , Mi = lO n M , a = 4.0, b = 0.57, c = 3.72, and 
d = 0.23 (Vale & Ostriker 2004; Cooray 2005a,b), while for 
SDSS r-band, we take Mi = 2 x lO n M and c = 3.78 with 
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Figure 3. Conditional luminosity functions today for a variety of masses as labeled on each of the four panels. The CLFs are divided to 
early- (red lines) and late-type (blue lines) galaxies, while for reference, we also show the total galaxy sample (black lines) with central 
and satellite galaxies shown with dotted and dashed lines, respectively. The CLF of high-mass halos are in good agreement with galaxy 
cluster LF, such as from Coma (Trentham & Tully 2002), that are neither fitted with a single Schechter function nor a simple power-law 
at the faint-end of the LF (Cooray & Cen 2005), though due to the M r < —17 cut-off in the measurements considered here, we ignore 
the latter complication. 



other parameters as above. The redshift evolution of this 
relation, based on high-redshift LFs, is discussed in Cooray 
(2005b). Following the analysis described there, where we 
constrained values for redshift- dependent parameters a and 
rj, we take fiducial values of -0.5 and -0.1; these were the 
best-fit parameters to the LFs of DEEP2 (Willmer et al. 
2005), COMBO- 17 (Bell et al. 2004), and rest B-band LFs 
of Gabasch et al. (2004). 

For satellites, the normalization A(M) of the satel- 
lite CLF can be obtained by defining L S (M, z) = 
Ltot(M,z) — L c (M,z) and requiring that L B {M,z) = 

minimum 

luminosity of a satellite is I/ m in- In the luminosity ranges of 
interest, and due to the numerical value chosen below for 
the slope 7, our CLFs are mostly independent of the ex- 
act value assumed for L m i n as long as it lies in the range 



(10 6 — 10 8 )-Lq. To describe the total luminosity of a halo, 
departing from the model used in Cooray (2005b), we make 
use of the following phenomenological form: 



L t ot(M,z) 



L c (M,z) 
Lc(M,z)( 



M 



M < Ms: 

M > M B: 



(4) 



Here, M sat denotes the mass-scale at which satellites begin 
to appear in dark matter halos with luminosities as cor- 
responding to those in the given sample of galaxies, while 
/3 s (z) is the correction to the power- law slope of the total 
luminosity-halo mass relation relative to that of the cen- 
tral galaxy-halo mass relation. We use this form since other 
parameterizations we considered resulted in unphysical sit- 
uations for certain parameter values in those descriptions, 
e.g., Ltot(Af) < L C (M), while other parameterizations did 
not provide useful constraints on parameters used for the de- 
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Figure 4. Halo occupation numbers. Left panel: For galaxies with absolute magnitudes between -18 and -23 in the SDSS r-band at 
redshifts < 0.1. Dotted lines show the central galaxy occupation number, dashed lines show the satellite occupation number, and the solid 
line show the total occupation number. Right panel: Redshift dependence of the halo occupation number, based on fiducial parameters 
for the L c (M,z) relation and Ltot(M, z) relation, as well as the redshift-dependent halo mass function, for galaxies with -18 and -22 in 
the r-band (as appropriate for SDSS at a redshift of 0.1) or B-band (for comparison with DEEP2 at a redshift of unity and GOODS 
at a redshift of 3.5). The occupation numbers suggest a power-law of ~ 0.75 for early-type galaxies and > 0.5 for late-type galaxies 
at the high mass end when dominated by satellites. Note also the transition from a dominant late-type fraction in central galaxies to 
a dominant early-type fraction at a halo mass around ~ 3 X 10 12 Mq, regardless of the redshift. When comparing halo occupation 
numbers determined from other data to the ones shown above, luminosity ranges of galaxies between different samples should be taken 
into account. Note that these occupation numbers are based on the fiducial model. When model fitting data, we find large degeneracies 
suggesting that the halo occupation slope is not well constrained by the observations. 



scription due to additional degeneracies. More importantly, 
the above form allows us to highlight easily an interest- 
ing result, involving the single parameter best constrained 
by clustering data, that we will discuss later. When show- 
ing models of CLFs in Figures 3 to 8, motivated by con- 
straints from that data that we will describe later, we take 
Af sat = 10 13 Mq and f3 s = 0.55 to describe r-band galax- 
ies with absolute magnitudes M r < —17. In Figures 11 and 
12, same numerical values for the parameters of the satel- 
lite CLF are also used at high redshifts and in the rest 
B-band, though we note a redshift-dependent variation in 
these parameters, especially when considering z — 4 clus- 
tering data from the Subaru Deep Field. Though we show 
figures with M sat = 10 13 Mq and f3 s = 0.55, this does not 
mean these are the best-fit values or our preferred values for 
these parameters. When we model fit the data, we will show 
constraints on these parameters explicitly and show that a 
rather large range of values is allowed by the data. While 
these two parameters are degenerate with each other, in ad- 
dition to SDSS data at z < 0.1, certain high- redshift data, 
such as COMBO-17 at z ~ 0.6 and Subaru/XMM-Newton 
Deep Field with clustering measurements at z ~ 4, do allow 
constraints to be placed on these parameters. 

While the above form refers to the total luminosity, 
when Ltot(M,z) > L c (M,z), this total luminosity must be 
distributed over a number of satellite galaxies in the halo 
when describing the satellite CLF. We take a power-law lu- 
minosity distribution and set j(M,z) = — 1 in Equation 1 
based on previous results derived on the CLF of galaxy 
groups and clusters (Cooray & Milosavljevic 2005b; Cooray 



2005a) and direct measurement in clusters such as Coma 
(Driver & De Propis 2002 where 7 = -l.OI+j};^); While the 
choice of 7 ~ —1 is motivated by the cluster LF, setting 7 
to a different value between -0.7 and -1.3, over a set of pa- 
rameter values we tested, did not change our results signif- 
icantly. Furthermore, for the maximum luminosity of satel- 
lites in a given halo, following the result found in Cooray & 
Milosavljevic (2005b) based on a comparison of predictions 
to the K-band cluster LF of Lin & Mohr (2004), we set 
imax = L c /2. A comparison to 2dFGRS CLFs as measured 
by Yang et al. (2005), however, suggested that such a sharp 
cut-off is inconsistent and that to account for scatter in the 
total galaxy luminosity, as a function of the halo mass, one 
must allow for a distribution in L max . Instead of additional 
numerical integrals, we allow for a luminosity dependence 
with the introduction of g s {L) centered around the maxi- 
mum luminosity of satellites such that $ sat (L|M) does not 
go to zero rapidly at Z/ max . By a comparison to the data, we 
again found a log-normal description with 



9s{L) = 



1 + erf 



log(L c /2.0) - log(L) 



(5) 



where a s = 0.3. The description here is such that /l = 1 
when L < Z/ mac = L c /2, but falls to zero at a luminos- 
ity beyond L c /2 avoiding the sharp drop-off at L c /2 with 
g B (L) — 1. Again, our results are mostly insensitive to pa- 
rameters of this description since variations here only lead 
to small changes to the overall CLF. 

The central galaxy CLF takes a log-normal form while 
the satellite galaxy CLF takes a power-law form in lumi- 
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Figure 5. Halo occupation numbers today as a function of the galaxy luminosity (as labeled on each of these plots). For reference, we 
divide the total occupation number to central (dotted lines) and satellite (dashed lines) galaxies. These are again based on our fiducial 
parameter description and these models are not unique to describe SDSS clustering data given large degeneracies between parameters. 
This is also clear from the fact that "best-fit" halo occupation models for same luminosity bins by Zehavi et al. (2004) suggest parameters 
that are distinctly different and involving even power-law slopes in mass greater than unity. 



© 0000 RAS, MNRAS 000, 000-000 



8 Cooray 




r (h 1 Mpc) r (h 1 Mpc) 



Figure 7. Projected correlation function of SDSS galaxies (from Zehavi et al. 2004). Left panel: For galaxies with M r < —20. Here, we 
show the prediction based on CLFs and variations associated with a change in the power-law slope of the total luminosity-halo mass 
relation. For reference, we also show the projected clustering power spectrum from the linear power spectrum alone, but scaled by the 
large-scale bias factor for galaxies with same luminosities. Right panel: Clustering of galaxy types and cross-clustering between galaxy 
types for the sample with M r < —21. For reference, we show both 2-halo and 1-halo contributions to the projected cross-correlation 
function between the two galaxy types. 



nosity. Such a separation describes the LF best with an 
overall better fit to the data in the K-band as explored by 
Cooray & Milosavljevic (2005b) and 2dFGRS 6j-band in 
Cooray (2005). Our motivation for log-normal distribution 
also comes from measured galaxy cluster LFs that include 
bright central galaxies where a log-normal component, in 
addition to the Schechter (1976) form, is required to fit the 
data (e.g., Trentham & Tully 2002). Similarly, the stellar 
mass function as a function of halos mass in semi-analytical 
models is best described with a log-normal component for 
central galaxies (Zheng et al. 2004). As we find later the over- 
all shape of the LF is strongly sensitive to the shape of the 
L c -M relation, and it's scatter, and less on details related 
to the Ltot-M relation. The non-linear part of the galaxy 
correlation function, or any clustering statistic, probes the 
satellite distribution and constraints can be put on the L to t- 
M relation. In fact, we find that the average luminosity of 
satellites, defined in Section 4, is the single parameter best 
constrained with current data. 

To describe galaxies as a function of color in this analyt- 
ical description, we must further divide central and satellite 
galaxies as a function of their color given the luminosity. 
Here, motivated by the bimodality of color (e.g., Baldry et 
al. 2004) that extends out to high redshifts (e.g., Giallongo 
et al. 2005), we consider models in terms of galaxy types. 
The description in terms of galaxy types is also useful since 
measurements at high redshifts, so far, involve the division 
of galaxy samples to two broad categories involving early- 
type, or red, and late-type, or blue, galaxies. Thus, in the 
case of early type galaxies we write the CLF as 



4> 



ear ' ly (L|M,z) = $ ccn (L|M, 2 )/ cal , y _ ccn (M,^) 



where the two functions that divide between early- and late- 
types are taken to be functions of mass, in the case of central 
galaxies, and both mass and luminosity in the case of satel- 
lites. These functions are described analytically as 



/"early — cen (-^^, 2-) — 



(7) 



/cea-E(z) 



1 + erf 



/ lQg(M) - l0g(M ce n(z)) 
V Nearly — ccn 



with fiducial parameters of M CC n(z) — 5 x 10 11 Mq, 
(Tcariy-ccn = 2.0, and / ccn _ E (z) = 0.6, and 

./"early — sat (M,L,z)= (8) 

3sat-E(z)s'(M, z) + g Bat -E(z)h(L, z) + /sat-E(.z), 

where, 



9 (M,z) = ± 1 + erf( lo g (M)-log(M Bat ( Z )) 

2 [ \ CTsat J 

h(L,z) = 1 1 + erf flogW-l°g(W*)Al 

2 [ \ CTsat / 

With Af sat = 10 13 Mq, Lsat = 3 X 10 9 L sun , CT sat = 1, 

/sat-E(z) = 0.4 and p sat -E(z) = 0.2; Early-type galaxies 
in the form of satellites varies from a fraction of / sa t- E (z) 
at low luminosity galaxies in low mass halos to 2(? sat _E(.z) + 
/sat-E(z) in halos with masses greater than 10 13 Mq. As 
fractions are defined with respect to the total galaxy number 
of a halo, late-type fractions are simply [1 — / C ariy-ccn(Af, z)] 
and [1 — /cariy-sat (M , L, z)\ for central and satellite galaxies, 
respectively and we do not need to specify there parameters 
separately. 

The fractions, following the fiducial values mentioned 
above — with some parameters estimated based on model 
fits to measurements described later — are shown in Fig. 1. 
The late-type fraction varies from ~ 0.8 at halo masses of 



$lf lly (L\M, Z ) = <T at (L|M,z)/ oarly _ sat (M,L, 2 ), (6) 
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Figure 8. Projected correlation function as a function galaxy absolute magnitude in the SDSS r-band (data from Zehavi et al. 2004). 
We show the predictions based on our fiducial model parameters. In addition to the total galaxy sample, when available, we also show 
the measurements, as well as predictions, for clustering of galaxy types. The CLFs associated with these predictions are shown in Fig. 3 
while the halo occupation numbers, based on an integration of the CLFs, are shown in Figure 4. 
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Figure 6. Galaxy bias as a function of SDSS r-band absolute 
magnitude as calculated from CLFs (solid line) with SDSS bias 
measurements shown with data points (from Zehavi et al. 2004). 
We also separate contributions from central galaxies (dotted line) 
and satellites (dashed line) to galaxy bias. We also show the bias 
for galaxy types (early- and late- type galaxies) . Late- type galaxies 
are expected to be in low-dense regions dominated by low-mass 
halos and their bias factor, relative to early type galaxies, would 
be lower. Satellite galaxies, regardless of the type, are in more 
massive halos and, thus, have higher bias factors relative to cen- 
tral galaxies. The average bias factor, shown here for the whole 
sample, however, is dominated by central galaxies due to the same 
reason that the LF is also dominated by central galaxies. 



10 Mq, in the form of central galaxies, to ~ 0.3 when 
M ~ 10 15 Mq corresponding to galaxy cluster scales, with 
the fraction essentially dominated by satellite galaxies. 



3 GALAXY LUMINOSITY FUNCTION 

Given the CLF, the galaxy LF is obtained through 

*((£,*)= / dM^(z)MMM,z), (10) 

where i represents the division to galaxy types. Here 
dn/dM(z) denotes the mass function of dark matter ha- 
los and we use the formalism of Sheth & Tormen (1999) in 
our numerical calculations. This mass function is in better 
agreement with numerical simulations (Jenkins et al. 2001) 
when compared to the Press- Schechter mass function (Press 
& Schechter 1974). Using our fiducial values for CLF param- 
eters, in Fig. 2, we show the SDSS galaxy LF (from Blan- 
ton et al. 2004) and the separation to early- and late-type 
galaxies. We only concentrate on galaxies with M r < — 17 
since this sample overlaps with galaxies used by Zehavi et 
al. (2004) for clustering measurements that are also used in 
the present analysis. The CLFs related to this description 
are shown in Fig. 3. At the faint-end, these CLFs flatten due 
to our assumption that the power-law slope of the luminos- 
ity distribution within halos is 7 = — 1, which is consistent 
with the LF of galaxies in clusters over the magnitude range 
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Figure 9. The three-dimensional cross-correlation function be- 
tween galaxies and dark matter as determined by Sheldon ct al. 
(2004) using SDSS galaxy-galaxy weak lensing measurements. 
The galaxy sample associated with this cross-correlation mea- 
surement a volume limited sample in redshifts between 0.1 and 
0.174 and in magnitudes between —23 < M r < —21.5. The vol- 
ume limited and luminosity-selected sample measurements allow 
an easy prediction based on the same fiducial parameters as those 
used to describe projected clustering measurements of galaxies. 
We also show the expected cross-correlation between mass and 
galaxy types. The dotted, dashed and dot-dashed lines show con- 
tribution from the central galaxy 1-halo term, the satellite galaxy 
1-halo term and from linear theory, scaled by a bias factor, re- 
spectively. 



of interest to this paper. At fainter magnitudes the CLF 
becomes complicated due to effects associated with the lu- 
minosity distribution of dwarf galaxies (e.g., Cooray & Cen 
2005). In the present analysis, we do not consider such low- 
luminosity galaxies with M r > — 16 and issues related to 
the subhalo mass function and associated substructure can 
therefore be easily ignored. 

In previous discussions of galaxy clustering under the 
halo model the occupation number has been widely used as 
a way to relate statistics of dark matter to galaxies (e.g., 
Kauffmann et al. 1999; Benson et al. 2000; Berlind et al. 
2003; see, review in Cooray & Sheth 2002). To compare with 
models of the halo occupation number, CLFs can be easily 
integrated such that 

N ccn (M,z) = / dL$ ccn (L\M,z) 



dL $ sat (i|M, z) 



(11) 



Since the halo occupation number captures no information 
on the luminosity distribution of galaxies, models involving 
the halo occupation number cannot be used to model the 
galaxy LF easily. We show the halo occupation number in 
Fig. 4 for -23 < M r < -18 galaxies in the SDSS sample 
(left panel) and redshift dependence of the halo occupation 
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Figure 11. Projected correlation function of galaxies at z ~ 1 as measured by the DEEP2 survey (Coil et al. 2004). Left panel: 
Clustering of galaxies divided into two luminosity samples with Mg < —19.75 (circles) and Mg > —19.75 (squares). The predictions 
based on CLFs are also shown; We assume a low-end magnitude of -18 for the faint sample, while no such assumption is needed at the 
bright-end due to the cut-off associated with the LF. Right panel: Galaxy clustering in the total sample divided to galaxy types. In both 
panels, dotted lines are predictions based on the linear theory power spectrum. 




Figure 13. Projected angular correlation function of LBGs at z ~ 4 as measured in the SubaruJ'XMM-Newton Deep Field (Ouchi 
et al. 2005). Left panel: Clustering of galaxies with i-band magnitudes brighter than 27.5, corresponding to rest-frame Mg < —18.5. 
For comparison, we also show expected clustering with the same luminosity cut at z ~ 5 and 6; High signal-to-noise ratio clustering 
measurements at such high redshifts are soon expected from Subaru and other deep drop-out surveys. At high rcdshifts, large-scale 
clustering increases due to the evolution in the halo clustering bias factor, but at the same time, non-linear clustering decreases as the 
number of galaxies that appear as satellites at a given luminosity begins to decrease at high redshifts. The dotted line shows the prediction 
based on linear theory at z ~ 4, scaled by the large-scale bias factor for galaxies with Mg < —18.5 (see, Cooray 2005b). Right panel: 
Galaxy clustering at z ~ 4, divided to luminosity bins (measurements from Kashikawa et al. 2005 using the Subaru Deep Field data) as 
indicated on the figure. At each of these luminosity bins, we assume same parameters related to satellites as the ones used to describe 
clustering for the sample in the left panel. The differences in the one-halo non-linear term prediction and small-scale observed clustering 
are due most likely due to variations in the parameters related to the satellite CLF as a function of satellite luminosity. For example, 
with M sat = 5 X 10 12 M and (3 S = 0.6, we over predict the small-scale clustering in the highest luminosity bin with Mg < —20.5, while 
small-scale clustering is well reproduced in the low luminosity bins. The excess in the high luminosity bin is due to the large number of 
satellites allowed, while a higher value for M sa t can be used to make the agreement with data better. This suggests that satellites with 
high luminosities appear in higher mass halos than compared to halos in which low luminosity galaxies appear; This is clearly consistent 
with the general expectation. A detailed analysis of luminosity dependent clustering at z ~ 4, using a new set of measurements from the 
imaging data in Ouchi et al. (2005), will be described in detail elsewhere (Cooray & Ouchi, in preparation). 
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Figure 10. Projected correlation function of galaxies at z ~ 0.6 
as measured by the COMBO-17 survey (Phleps et al. 2005) and 
divided into clustering of early- and late- type galaxies. The pre- 
dictions based on our fiducial model description, with appropriate 
parameters for redshift evolution of the L C (M, z) relation, are also 
shown. In the case of early-type galaxies, we also show variations 
in the power-law slope of the total luminosity-halo mass relation. 
While not specified as part of the observations, we have assumed 
this sample corresponds to Mg < — 18 when model fitting the 
data. 

number (right panel). At high redshifts, the occupation num- 
bers are at the B-band since galaxy samples in COMBO-17, 
DEEP2, and GOODS are defined at this wavelength. While 
Subaru/XMM-Newton Deep Field sample is defined in the 
observed i-band, we assume rest-frame B-band luminosities 
when model fitting the data. 

In the case of satellites, since A r sat (M, z) = 
A(M,z) J dLL 7(M) / s (L) at the high mass limit of the 
halo mass with j(M) a constant and when L to t(M,z) 3> 
L c (M,z) we expect N sat (M) oc L to t(M,z) ~ M ps+a , where 
/3 S is the slope introduced in equation 4 and a is the slope of 
the L C (M) relation at the same halo masses. With a s ~ 0.2, 
and p s = 0.55, the fiducial slope of occupation number is 
around 0.75, though this slope is mass dependent given the 
rapid variation of central galaxy luminosity with halo mass. 

In Fig. 5, we present the halo occupation number as a 
function of luminosity considered by Zehavi et al. (2004) for 
clustering measurements. These occupation numbers, based 
on CLFs, can be compared with best-fit halo occupation 
models suggested in Zehavi et al. (2004). In our fiducial 
model, satellites with M r < — 17 only appear in halos with 
masses greater than M sat = 10 13 Mq . We see a cut-off in the 
halo occupation number of central galaxies at masses around 
10 11 Mq. At M r < -20, this cut-off is around ~ 7 x 10 11 
Mq . This value can be compared to the suggested minimum 
value of ~ 10 12 Mq for the halo occupation number down 
to the same magnitude in Zehavi et al. (2004). The differ- 
ence can be understood based on the fact that Zehavi et 
al. (2004) description of the satellite halo occupation num- 
ber is (M/Mi) a with a no cut-off at a lower mass, while the 




9 (arcsec) 



Figure 12. Projected angular correlation function of galaxies 
at z ~ 3 as measured by the GOODS survey (Lee et al. 2005). 
The measurements are for the total sample, but for comparison, 
we also show the expected clustering of red- and blue-galaxies if 
the sample had been divided to galaxy types. For comparison, we 
also show the 1-halo contribution. 

minimum halo mass cut-off only applies to the central galaxy 
occupation number. It could be that the degeneracy between 
the central and satellite galaxy occupation numbers leads to 
an overestimate in the minimum mass for central galaxies 
to appear, while that overestimate is partly accounted with 
an increase in the slope of the halo occupation number for 
satellite galaxies. 

As stated in Zehavi et al. (2004), the halo occupation 
model parameters suggested there are not unique. The mass 
cut-off detected based on the halo occupation number model 
fits to galaxy statistics should not be treated as a general 
lower limit on halo mass to host galaxies. The cut-off usu- 
ally one detects with occupation numbers is the minimum 
halo mass to host a galaxy given the minimum luminosity of 
galaxies in the sample under consideration (for example, the 
minimum mass of the central galaxy halo occupation num- 
ber as a function of luminosity in Fig. 5). It could be that 
halos with a mass lower than the cut-off continue to host 
galaxies, but with a lower luminosity, and due to sample 
selection criteria such halos would not be included in the 
sample used for clustering studies. We will return to this 
below in the context of model fits to clustering data where 
we find no conclusive evidence for a general minimum halo 
mass to host galaxies, for galaxies with M r < — 17. 

In addition to the low mass cut-off of central galaxies, 
we also have the freedom to select a low mass for the ap- 
pearance of satellites with the parameter M sat . In Figures 4 
and 5, we have set M sat = 10 13 Mq, though best-fit halo 
occupation numbers from Zehavi et al. (2004) suggest the 
presence of satellites, as appropriate for the same sample of 
galaxies, in halos with a lower mass than this. While this 
could be due to differences between the model, as stated be- 
fore, the occupation models as well as our CLFs may not 
be unique. Later, we will use data to constrain parameters 
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Figure 27. The predicted cross-correlation between SDSS faint 
and bright samples of galaxies. For comparison we show the clus- 
tering of galaxies in each of the luminosity bins (from Zehavi et 
al. 2004), but cross-clustering between luminosity bins is yet to 
be measured. We propose such a measurement as a way to im- 
prove constraints on parameters related to the satellite CLF of the 
fainter sample. In addition to cross-clustering between galaxies in 
separate luminosity bins one can also consider cross-clustering 
between galaxy types (shown as the upper dot-dashed line for 
clustering between early type galaxies in the brighter sample and 
late- type galaxies in the fainter sample). A complete set of such 
measurements across several luminosity bins, in the form of a co- 
variance matrix of cross-correlations C(L l a , L 3 b ,r), will provide all 
the information related to galaxy clustering at the two-point level 
and will provide additional information to constrain parameters 
related to the galaxy type CLFs. 



such as M sat and find large degeneracies between j3 a , the 
power-law slope, and M sat such that as M sat is lowered, (3 a 
is increased. The same degeneracy should also appear in 
model fits based on the halo occupation number. For exam- 
ple, with a larger value for M m i n , the minimum mass for the 
central galaxy halo occupation number as in model descrip- 
tions of Zehavi et al. (2004), one should find a larger slope a 
for the satellite halo occupation number such that the total 
number of satellite galaxies remains the same; This behavior 
could partly explain the unusually large values for the slope 
suggested in Zehavi et al. (2004). The degeneracy between 
M sa t and j3 B suggests that a single parameter involving the 
combination of these two parameters can be best determined 
with the data. As we find later, this parameter is the total 
luminosity of satellite galaxies averaged over the halo mass 
distribution that hosts galaxies between — 17 > M r > — 23 
in the SDSS sample. 



4 GALAXY CLUSTERING WITH CLFS 

Using the CLF, instead of the halo occupation number, we 
can write the power spectrum of galaxies between type i and 
type j in terms of the 1- and 2-halo terms (see, review in 



Cooray & Sheth 2002) at a redshift z as 

pii l (k\L, Z )=pii(k\L, Z )+pii(k\L,z) ! 



where 



PH(k\L,z) = - 



m(L, z)nj{L, z) 



dM 



dn(z) 
dM 



$r*(i|M, z)&?(L\M, z)u 2 ml (k\M, z) 



+ m° n (L\M, z)&?(L\M, z) %a i(fc|M, z) 
+ *f n (L|M, z)®T\L\M, z)u gal (k\M, z) 



and 

Pg(k\L,z)=P(k,z) 



i? n (k\L,z)irm, Z ) 



+ ir u {k\L,z)Ij{k\L,z) + ir t {k\L,z)I^ n {k\L,z) 
+ ir\k\L,z)I™\k\L,z) 
with the integrals I ccn (k\L, z) and J sat (fcjL, z) given by 
ir(k\L,z) = 

lT\k\L,z) = 



(12) 



dM ,;/ &i(M,z) _ r and 



dM 



rii(L, z) 



,^dn(z), ,$f at mM,z) ,,.„, 

dM —r^-bi{M,z) _ 1 f^-M g al(ifc|M, 



dM 



rti(L, z) 



respectively. Here, and above, 
dn(z 



m(L, z) = 



dM- 



dM 



(L\M,z) + ^T t (L\M,z)] (14) 



denotes the mean number density of galaxies of type i while 

2 sin kr p ga i(r|M, z) 



Ugal(ft|M, z) 



i i r 47rr 



kr 



M 



(15) 



denotes the normalized Fourier transform of the galaxy den- 
sity distribution within a halo of mass M when bi (M, z) is 
the first-order bias factor of dark matter halos. Here for dark 
matter halo bias we use the bias factor derived by Sheth, Mo 
& Tormen (2001) which corrects earlier calculations by Mo 
et al. (1997; Efstathiou et al. 1988; Cole & Kaiser 1989) 
based on spherical collapse arguments. 

The standard assumption in above equations is that 
galaxies trace dark matter within halos such that one can 
utilize the dark matter distribution given by analytic forms 
such as the NFW (Navarro et al. 1996) profile. An improved 
approximation will be to use the density distribution de- 
fined by sub-halos to describe galaxies and, instead of the 
halo mass function, use a combination of halo mass func- 
tion and the subhalo mass function to describe the satellite 
contribution to galaxy clustering that also accounts for ef- 
fects associated with substructure (e.g., Sheth & Jain 2002). 
Even if corrections exist for the power spectrum from the 
subhalo mass function, these only modify the strongly non- 
linear regime and leave the transitional regime from linear 
to non-linear clustering probed by current galaxy clustering 
measurements unaffected. Since relevant profiles related to 
substructure is still not well studied numerically we make 
use of the NFW dark matter density profile (Navarro et al. 
1997) to describe the galaxy distribution within halos. The 
concentration parameter is defined following the scaling re- 
lation of Bullock et al. (2001). The relevant expressions in 
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Figure 14. Constraints on parameters cr ccn , the log-normal dispersion of the central galaxy — halo mass relation, and M — cen — cut, 
the lower halo mass to host a central galaxy, independent of luminosity, related to the central galaxy CLF description. The left panel 
shows the constraint based on the SDSS LF (from Blanton et al. 2004), down to M r of -17, and the right panel shows the constraints 
from SDSS galaxy clustering measurements (from Zehavi et al. 2004). While the LF strongly constrains these parameters, clustering 
measurements do not. The difference comes from the fact that clustering measurements are more sensitive to satellite galaxies while, as 
shown in Figure 2, LF measurements are sensitive to statistics of central galaxies. 
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Figure 15. Constraints on parameters / con _ e, the fraction of early-type central galaxies at the high mass-end and M ce n related to the 
analytical description of the early-type galaxy fraction of central galaxies. The left panel shows the constraint based on SDSS galaxy type 
LFs (from Blanton et al. 2004), down to M r of -17, and the right panel shows the constraints from SDSS galaxy clustering measurements 
divided to galaxy types between magnitudes bins from -18 to -21 (from Zehavi et al. 2004). As above, while LF strongly constrain 
parameters related to central galaxies clustering measurements do not. 



our calculation are summarized in Cooray & Sheth (2002). 
In a future paper we plan to combine galaxy-galaxy lensing 
measurements with galaxy clustering measurements to test 
the extent to which galaxies trace dark matter. For now, 
we will ignore any differences in the galaxy profile relative 
to dark matter and concentrate only on basic parameters 
related to the CLF rather than statistics such as profiles. 

In Equation 1121 when i = j, the expression reduces to 
the power spectrum of galaxies of the same galaxy type. 



Similarly one can ignore the index i and j and replace the 
CLF with the total CLF to calculate the power spectrum of 
the total galaxy sample at a given luminosity. Furthermore, 
one can also consider the cross power spectrum of samples 
between (Li,i) with (L2, j), where i and j denote the type, 
but instead of at a fixed luminosity, cross-correlations are 
considered between different luminosities. In this case, the 
above expressions must be generalized for the case with two 
different luminosity bins. Since this is straightforward, we 
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Figure 16. Constraints on parameters that describe galaxy types related to the satellite CLF (see, equation[IJ based on galaxy clustering 
data divided to galaxy types between —18 > M r > —22 in SDSS. While M sa t is not strongly constrained with clustering data, / sa t— e 
describing the fraction of early type galaxies as satellites in low mass halos is constrained to be between 0.5 ±0.15 at the 68% confidence 
level. The right-panel shows constraints on parameters / sa t_ e and g S at— E- 



do not reproduce the appropriate expressions here. These 
cross-correlation measurements between two different lumi- 
nosity bins and different galaxy types across those bins are 
yet to be measured. These measurements provide the full set 
of clustering measurements related to galaxies and can be 
thought of as a covariance matrix of the form C(L' a , L J b ,r) 
where a and b are indices over the luminosity bins and i 
and j are indices over the galaxy types, while r is the pro- 
jected length at which clustering is measured. Towards the 
end of our discussion, we will motivate such a full set of 
measurements as a way to establish the satellite CLF more 
accurately. 

For reference, to compare with lensing-lensing galaxy 
measurements, the cross-power spectrum between galaxies 
of type i and the dark matter distribution is 



P S -i(k\L,z) - 



1 



Pt\k\L,z) 



rn{L, z) 



dM 



2h 

M dn(z) 
~J dM 



where 



<br\L\M,z)ul &l {k\M,z) 



+ t>r{L\M,z)u^{k\M,z) 



and 

P^ i (k\L,z)=P(k,z) 



ir(k\L,z)I s (k, z) 



+ Ii(k\L, z)I (k, z) 
with the integral I s (k, z) given by 



I\k,z) 



dM—^^hiM, z)u gal (k\M, z) 
p dM 



(16) 



(17) 



and p is mean comoving density of dark matter. 

At large scales, the galaxy power spectrum or the 
cross-power spectrum, reduces to that of the linear power 
spectrum scaled by the constant galaxy bias factor(s). 



One can understand this by noting that at large scales, 
u sa .l(k\M, z) — » 1 and the galaxy power spectrum simplifies 
to 



P gal (fcjL, z) « h{L, z)bj{L, z) P(k, z), 
where 



(18) 



bi(L,z) = 

dn(z) 



dM- 



dM 



bi{M,z] 



(19) 

[$f n (L\M, z) + $f \L\M, z)] 
ni(L, z) 



is the mean large-scale bias factor of the i-type galaxy pop- 
ulation. This large-scale bias factor has already been dis- 
cussed using CLFs previously (see, Cooray & Milosavljevic 
2005b; Cooray 2005a,b) and we summarize results based on 
the current analysis in Fig. 6. 

Given the power spectrum, the three-dimensional cor- 
relation function of galaxies of type i and j with luminosity 
L at a redshift of z is 



£ij(r\L,z) = 



k2dk Pl ML,z) Sin{kr) 



2tt 2 



kr 



(20) 



Given limited statistics, most measurements are averaged 
over samples of galaxies distributed over a certain redshift 
range. In this case the projected correlation function follows 



«p (r\L, 



kdk 
i 

2tt 



\L,z)J (kr) 



(21) 



In the case of a broad redshift distribution of galaxies over 
which clustering is projected the same correlation function 
is generally written in terms of angular scale, 8, with the 
correspondence r = OdA, where dA is the comoving angu- 
lar diameter distance. To calculate such a broad correlation 
function in redshift space, we average over the galaxy red- 
shift distribution associated with clustering measurements 
such that 
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Figure 17. Constraints on parameters /3 S , the additional power-law slope of total luminosity-halo mass relation (in addition to the slope 
of central galaxy-halo mass relation), and M sa t, the halo mass scale at which satellites begin to appear, related to the satellite CLF. 
These constraints come from clustering measurements from SDSS (top left), COMBO-17 (top right), DEEP2 (middle left), GOODS 
(middle right), and Subaru/XMM-Newton Deep Field (bottom panel) at rcdshifts less than 0.1, around 0.6, around unity, between 2.5 
and 3.5, and at 4, respectively. We only make use of total clustering data divided to galaxy luminosity bins here, but the constraints 
shown above are for the combine data set at each of the redshifts. In later figures, we will highlight differences between certain luminosity 
bins instead of the overall constraint shown here. 
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Figure 18. (L S at), the sample-averaged luminosity of satellites for the given sample of galaxies ("equation 1241 . as a function of /3 Bt the 
power-law slope of total luminosity-halo mass relation, and Aisat, the halo mass scale at which satellites appears. The dashed lines show 
the average satellite luminosity for SDSS sample while dotted lines show the same at z ~ 3 as appropriate for the GOODS survey. For 
reference, we overlap constraints on this parameter space from SDSS (same as Figure 17, top left panel). The degeneracy in f) a — M sa t 
plane traces contours of equal average satellite luminosity and this single parameter is best constrained by current galaxy clustering 
measurements . 



wl j (e\L,z) = J drn 2 (r) J —P tJ (k\L, z)J {kd A 0) , (22) 

where n(r) is the normalized radial distribution of galaxies 
with J drn(r) — 1. 

In our model predictions we calculate the projected cor- 
relation function at the mid point of the redshift distribu- 
tion of galaxies used in that sample. The measurements 
where the exact redshift distribution plays an appreciable 
role is those of the GOODS survey (Lee et al. 2005) and 
the Subaru Deep Field (Ouchi et al. 2005) since galaxies are 
broadly distributed over a wide range in redshift from 2.5 to 
4.5 and from 3.5 to 4.5, respectively. Fortunately, for both 
these surveys the expected redshift distribution of galaxies is 
known either through Monte Carlo simulations, in the case 
of GOODS (Lee et al 2005), or, in the case of Subaru Deep 



Field, through a combination of spectroscopic redshift mea- 
surements and Monte Carlo estimates (Ouchi, private com- 
munication). We take these distributions into account when 
model fitting GOODS and Subaru w p (8) measurements. 

Another uncertainty in some of these measurements is 
the exact luminosity distribution of galaxies in the sample. 
For surveys such as SDSS and DEEP2 galaxy luminosities 
are a priori known and samples are binned in luminosity, 
while for surveys such as GOODS and Subaru Deep Field, 
the exact luminosity distribution remains uncertain, though 
statistics in terms of the apparent magnitude at the observed 
wavelength. As appropriate, given the redshift information, 
we converted some of the suggested apparent magnitudes of 
galaxies in the sample to rest-frame luminosities at the ob- 
served wavelength, usually in the rest B-band, and used that 
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Figure 19. Left panel: (L sa t}, the sample-averaged luminosity of satellites as a function of f3 B and M aa t at z = 4 for galaxies with 
Mg < —18.5. For reference, we overlap constraints on this parameter space from Subaru (same as Figure 19, bottom panel). Right panel: 
n(z), the number density of galaxies at z ~ 4 with Mb < —18.5, as a function of as a function of f) s , the power-law slope of total 
luminosity— halo mass relation, and M sa t, the halo mass scale at which satellites appears (in units of 10 -3 h^ Mpc - 3 ). For reference, we 
overlap constraints on this parameter space from Subaru/XMM-Newton Deep Field (same as Figure 17, bottom panel). The degeneracy 
in fis-Msnt plane also traces essentially contours of equal number density of galaxies as well as equal values for (L Ba ,t). 




Figure 20. Constraints on parameters /3 S , the additional power-law slope of total luminosity-halo mass relation (in addition to the 
slope of central galaxy— halo mass relation), and M sa t, the halo mass scale at which satellites begi n to appear, related to the satellite 
CLF from Subaru/XMM-Newton Deep Field at z = 4 compared with constraints from SDSS (left panel) and COMBO-17 (right panel). 
At the lcr confidence, a clear difference between constraints at z ~ 4 and at z ~ 0.6 is clear. In the text, we discuss this difference in the 
context tha t we find no difference in the redshift dependence of (L aa t). 



information to establish the minimum luminosity of galax- 
ies in the sample. The minimum luminosity usually plays a 
larger role while the maximum luminosity of galaxies in the 
sample does not due to the bright-end exponential cut-off in 
the galaxy LF. 

In Fig. 7, we show the projected correlation function 
of SDSS galaxies with M r < -20 from Zehavi et al. (2004) 
and a comparison to model predictions based on the CLF 
using the fiducial description of model parameters. The cor- 
responding CLFs of these model fits are in Fig. 5. For refer- 



ence, Fig. 7 left panel illustrates the dependence of projected 
correlation function when the power-law slope of the total 
luminosity-halo mass relation is varied with M sat fixed at 
the same fiducial value. In general, an increase in (3 3 can be 
compensated by an increase in M sat . This degeneracy will 
become clear when we study model fits to the data later. In 
the right panel of Fig. 7, we illustrate the projected correla- 
tion function of galaxy types as well as the cross-correlation 
between two galaxy types with M r < — 21. The presence of 
a non-linear part for the cross-correlation between galaxy 
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Figure 21. The constraints on M sa t-/3 S plane at two different luminosity bins: dotted lines show the constraint when —18 > M r > —19 
and solid lines show the case for —21 > M r > —22. We find some evidence for an increase in M sa t as the galaxy luminosity is increased, 
but the exact dependence between M sa t and galaxy luminosity is not well established with current data. 



types can be easily understood based on the fact that both 
early- and late-type exists in similar mass halos (Zehavi et 
al. 2004). 

In Fig. 8, we summarize the projected correlation func- 
tion as a function of luminosity bins considered by Zehavi et 
al. (2004); For the faintest (-17 to -18) and the brightest (- 
22 to -23) bin, only the total clustering correlation function 
is measured, though for comparison, we continue to show 
the clustering correlation function for both early- and late- 
type galaxies. Note that with our fiducial model parameters, 
measured projected correlation functions in magnitude bins 
between -19 to -20, -20 to -21, and -21 to -22 are generally 
well described, while fits are generally less than perfect in 
both the low luminosity and high luminosity bins. This is 
due to the fact that our fiducial model parameters are ex- 
tracted from an overall fit to the whole sample assuming 
the same underlying description for the CLF for the whole 
galaxy sample. When model fitting the data, since measure- 



ments in mid magnitude bins are better determined, the fits 
are weighted more for these bins than ones at the two ends. 
We did not attempt to weight different bins equally. At this 
initial stage of analysis we are mostly interested in extract- 
ing a consistent model for the overall CLF of galaxies from 
current measurements or trying to understand the extent 
to which data can constrain parameters related to the CLF. 
The models considered in Zehavi et al. (2004) involved differ- 
ent occupation number descriptions for different luminosity 
bins. The CLF approach avoids having to describe occupa- 
tion numbers separately for different luminosity bins, though 
it is likely and, guaranteed to be, that some parameters such 
as M sa t will be luminosity dependent, though parameters 
such as /3 S should not be. We will also show results where we 
model fitted the data separately based on divisions to lumi- 
nosity bins. While the overall fits are not strong, we do find 
certain limited evidence for variation in M sat as a function 
of the luminosity. While the CLF description leads to a cer- 
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Halo Mass (h~ 1 M sun ) Halo Mass (h~ 1 M sun ) 

Figure 22. Left: The minimum mass for the appearance of satellite galaxies at a given luminosity as a function of the halo mass based 
on a catalog of galaxy groups and clusters in the SDSS (described in Weinmann et al. 2005). Here, each data point represents a group or 
a cluster where the the halo mass was determined based on the total luminosity of the halo. Each data point represents the luminosity 
of the faintest galaxy assigned to each halo. Here, we focus on the r'-band luminosity as the sample used for galaxy clustering in Zehavi 
et al. (2004) is defined for that sample. The solid line shows the relation established from this catalog between the minimum luminosity 
in the r'-band and the halo mass: Mr{min) « -19.8(M/10 12 M ) 03 . Right: The maximum luminosity of a halo as a function of the 
halo mass. Here, we plot the luminosity of the brightest galaxy assigned to that halo (which may or may not the central galaxy in terms 
of cluster/group dynamics). The long-dashed line is the same relation as that in the left panel. The solid line is the relation between 
central galaxy luminosity and the halo mass as described in equation (3) at z = as required to explain the SDSS luminosity function 
from Blanton et al. (2004). The mass estimate is highly uncertain when halo masses are below few times 10 12 Mq due to the small 
number of galaxies and the scatter in the luminosity of the dominant galaxy. Due to this, and the uncertain assignment of galaxies that 
are satellites of a bigger halo to less massive halos, we do not consider the difference at the low mass end between the brightest galaxy 
luminosity and the expected luminosity from the L C (M) relation to be any concern. 



tain reduction in the number of parameters to be determined 
from data, though we note that, due to our introduction of 
new parameters involving galaxy types etc, there is in fact 
no reduction but rather an increase in parameters. Later, 
in the discussion, we will propose additional measurements 
related to the same sample of galaxies in SDSS as Zehavi et 
al. (2004) and these measurements could further aid in im- 
proving model fits to determine current parameters better. 

To show that our models are consistent, in Fig. 9, we 
compare our prediction for the galaxy-mass cross-correlation 
function, in real space, for a volume limited sample with 
—21.5 > M r > —23, and in the redshift range between 0.1 
and 0.174. This galaxy sample has been used by Sheldon et 
al. (2004) to make a measurement of the galaxy-mass cross- 
correlation function via galaxy-galaxy weak lensing mea- 
surements in SDSS. We find our predictions agree well with 
measurements, and as a further application, in Fig. 9, we 
also show the expected cross-correlation if the foreground 
galaxy sample of Sheldon et al. (2004) had been further 
divided to blue- and red-galaxies, following essentially the 
same division to galaxy types as in Zehavi et al. (2004). 

In Cooray & Milosavljevic (2005a) we made use of SDSS 
galaxy-galaxy weak lensing measurements in the z'-band to 
construct L C (M) relation at higher wavelengths. These mea- 
surements are analyzed using the halo model in a variety of 
studies (e.g., Mandelbaum et al. 2004; Guzik & Seljak 2002; 
Yang et al. 2003a; Sheldon et al. 2004) and we do not make 



use of the galaxy-mass correlation function when model fit- 
ting parameters here. This is due to the fact that we are 
primarily interested in understanding the extent to which 
CLFs can be constructed from galaxy clustering measure- 
ments and to check our estimates, say, on the halo mass of 
galaxies at a given luminosity from estimates made by prior 
studies using galaxy-galaxy lensing measurements. We do 
this in the context of probability distribution for halo mass 
at a given galaxy luminosity (Mandelbaum et al. 2004) . 

The approach based on CLFs easily allows us to model 
clustering statistics at high redshifts within the same pa- 
rameter description provided that redshift dependences are 
properly taken into account. Given the results from Cooray 
(2005b) on the redshift evolution of the L C (M) relation, here 
we take the redshift dependence of the central galaxy lumi- 
nosity with redshift into account with parameters a and r\ 
in equation 5. For parameters in the satellite CLF, such as 
P„ and M sat , we do not attempt to include redshift varia- 
tions given the lack of knowledge. On the other hand, red- 
shift dependences can be extracted by analyzing clustering 
measurements as a function of redshift and by looking for 
differences in parameters constrained at different redshifts. 
This was the approach used in Cooray (2005b) to establish 
redshift variation in L C (M) relation. 

In Fig. 10, we compare our predictions for projected 
galaxy clustering at redshifts 0.4 to 0.8 as determined by 
the COMBO-17 survey (Phleps et al. 2005). These data in- 
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Figure 23. Fraction of satellites as a function of the luminosity bin in r-band, as labeled on each of the four panels, as a function of 
/3 S , the power-law slope of total luminosity— halo mass relation, and M sa t, the halo mass scale at which satellites appears. For reference, 
we overlap the constraints on this parameter space from SDSS as relevant for each of the luminosity bins; Note that these constraints 
are worse than the overall constraint on this plane when the galaxy sample is combined. Satellite fractions range from 0.05 to 0.15, 
when —22 < M r < —21 to ~ 0.1 to 0.5 when —19 < M r < —18 at the 68% confidence level. These fractions are consistent with values 
suggested in Mandelbaum et al. (2004) in the three low-luminosity bins based on an analysis of galaxy-galaxy lensing data with numerical 
simulations. 



volve rest B-band magnitudes and we make use of the L C (M) 
relation as appropriate for rest B-band from Cooray (2005b) 
including the redshift evolution with parameters described 
with respect to equation 3. While our fiducial parameters de- 
scribe the non-linear clustering part of early- and late-type 
galaxies in this sample well, we find that large-scale clus- 
tering of late-type galaxies is not modeled by our parame- 
ters. We find the same difference between measurements and 
model fits based on the halo occupation number appears in 
the analysis by Phleps et al. (2005) as well. We use this data 
set to extract parameters related to the satellite CLF and 
find that constraints on f3 s and M sat allowed by COMBO-17 
at a mean redshift of 0.6 is in good agreement with SDSS 
suggesting that no strong evolution of parameters such as 
/3 S and M sa t out to this redshift. 



In Fig. 11, we consider galaxy clustering at z ~ 0.8 to 
1.3 from the DEEP2 survey (Coil et al. 2004). Here, clus- 
tering measurements are divided to two-luminosity bins, in 
the rest B-band, and the combined sample to early- and late- 
types. As shown in Fig. 11, our fiducial model parameters re- 
lated to central and satellite CLFs describe DEEP2 cluster- 
ing measurements at z ~ 1 reasonably well. Unfortunately, 
DEEP2 data mostly probe large-scale linear clustering pat- 
tern rather than the non-linear 1-halo part that is strongly 
sensitive to model parameters related to the satellite CLF. 
As we find later, because of this reason, DEEP2 data only 
allow upper limits to be placed on model parameters such 
as /3 a and M sat at a redshift of unity. Since measurements 
considered here only come from the first subset of the total 
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Figure 24. Probability distribution for the fraction of satellites as a function of the luminosity bin in r-band, as labeled on each on 
the left panel. Left panel: Satellite fraction with /3 B taken as a uniform prior between and 1. Right panel: Satellite fraction with the 
constraint that 0.4 < /3 S < 1. The lower estimate was taken to be roughly consistent with the minimum luminosity — halo mass relation 
suggested by the SDSS galaxy group catalog data (Fig. 22), combined with clustering constraints shown in Fig. 21 and 23. 




Mass (h~ 1 M sun ) Mass (h~ 1 M sun ) 

Figure 25. The conditional probability distribution function of halo mass P(M\L, z) to host a galaxy of the given luminosity and at 
the given redshift as a function of the halo mass. The black lines are the total galaxy sample while red and blue lines show the sample 
divided to early- and late-type galaxies. Left- and right-panels show these probabilities for M r or Mb magnitudes between -18 and -19, 
and between -22 and -23, respectively at redshifts of 0.1 (in r-band), 3 and 5 (in B-band), in decreasing thickness of lines. 



DEEP2 galaxy sample, the final clustering analysis should 
improve parameter estimates significantly. 

Extending to higher-redshifts, we make use of the rest 
B-band clustering measurements at z ~ 3 by Lee et al. 
(2005) in the GOODS survey. Due to limited number statis- 
tics, measurements only exist for the total galaxy sample 
though in Fig. 12 we also show the clustering of early- 
and late-type galaxies as well. At z ~ 4, the recent an- 
gular clustering measurements by Ouchi et al. (2005) in 
the Subaru/XMM-Newton Deep Field can also be mod- 
eled using the CLF approach. In Fig. 13 (left panel), we 



show the measurements with i-band magnitudes brighter 
than 27.5. This magnitude limit roughly corresponds to the 
rest Mb < —18.5, and this conversion is consistent with 
the galaxy number density expected from the rest B-band 
galaxy LF at a redshift of 4 (with a number density of 
5 x 10" 3 h 3 Mpc -3 from Cooray 2005b) and the suggested 
number density of 5.8 ± 1.4 x 10~ 3 h 3 Mpc" 3 in Ouchi et 
al. (2005: see their Table 1) down to the same magnitude 
limit. To describe non-linear clustering at these redshifts, 
the satellite CLF must have distinctively different parame- 
ters for the slope /3 a and the low mass cut-off M aa t for the 
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Figure 26. The conditional probability distribution function of halo mass P(M\L, z) to host a galaxy of the given luminosity at a given 
redshift as a function of the halo mass. The four panels show these probabilities at different redshifts as labeled on each of the panels, 
while the plotted curves are for magnitudes between [-19,20], [-20,-21] and [-21,-22], in r-band at a redshift of 0.1 and B-band for other 
redshifts, with probabilities shown separately for central (dotted lines) and satellite (dashed lines) galaxies. These probabilities based on 
CLFs can be compared with the same probabilities extracted from an analysis of SDSS galaxy-galaxy lensing data in Mandelbaum et 
al. (2004; see, their figures 3 and 4) using numerical simulations. 



appearance of satellites when compared to parameters. In 
Figure 1, we show the expected clustering level based on 
best-fit parameters that we will return to below. For com- 
parison, in the same figure, we also show expected clustering 
of galaxies down to the same magnitude level at redshifts 5 
and 6. At large angular scales, as the redshift is increased, 
clustering strength is expected to increase due to redshift 
evolution of bias factor, which is in return associated with 
the decreasing number density of halos that host the galax- 
ies with the same luminosity when compared to the number 
density at a lower redshift. At small scales corresponding to 
the 1-halo term, galaxy clustering should show a decrease 
in strength as the number of galaxies that appear as satel- 
lites with the same luminosity is decreasing as the redshift 
is increased. 



In Fig. 13 (right panel), we consider clustering as a func- 
tion of the galaxy luminosity at z ~ 4. The measurements 
shown here now come from Kashikawa et al. (2005) based 
on clustering measured with the Subaru Deep Field 1 . To 
describe luminosity dependent galaxy clustering we use the 
same CLFs as the ones used to describe galaxy clustering 
at z ~ 4 in the right panel, but divided the absolute lu- 
minosities of galaxies following the division in Kashikawa 
et al. (2005) based on apparent magnitudes. While galaxy 
clustering in the fainter bins are adequately described, the 
non-linear clustering seen in the brighter bin is overesti- 
mated. This is clearly due to a wrong choice of parame- 
ters related to the satellite CLF at z ~ 4 for these bright 



1 http://soaps.naoj.org/sdf 
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galaxies. Since our models here assume the best-fit param- 
eters with Mb < —18.5, the over prediction of non-linear 
clustering for galaxies with Mb < — 20 clearly shows that 
galaxies that appear as luminous satellites are only present 
with a higher cut off for M sat . While the Kashikawa et al. 
(2005) measurements only allow one estimate of clustering 
in the non-linear regime, we have begun a separate analysis 
of luminosity dependent clustering from the same imaging 
data as those used in Ouchi et al. (2005). Those measure- 
ments increase the signal-to-noise of non-linear clustering 
estimates as a function of redshift allowing the mass scale 
associated with satellites, as a function of their luminosity, 
be established better when compared to published measure- 
ments from Kashikawa et al. (2005) shown in Figure 13 (right 
panel). We will present these results in an upcoming study 
(Cooray & Ouchi 2005). 

To study the extent to which model parameters related 
to the CLF can be constrained, we now model fit LFs and 
clustering measurements by varying various parameters in 
our model. From these model fits, we establish likelihoods 
to describe the data given model parameters. In this anal- 
ysis, we only make use of published variance measurements 
of both the LF and clustering statistics. It is likely that 
the measurements are affected by a covariance resulting in 
correlations between clustering measurements at different 
physical or angular scales. The presence of a substantial co- 
variance in angular projected correlation function is well 
known due to non-Gaussianities and overlapping window 
functions (e.g., Eisenstein & Zaldarriaga 1999; Cooray & 
Hu 2001). While the model based on CLFs has a large num- 
ber of free parameters (~ 20), various experimentation with 
the data showed that only a handful of parameters are con- 
strained while other parameters remain unconstrained for 
various reasons. Thus, we only consider a subset of parame- 
ters to model fit while other parameters are fixed based on 
various other arguments and observations. For example, we 
fix parameters of the L C (M) relation and do not attempt 
to establish them from galaxy clustering data. As discussed 
in Cooray & Milosavljevic (2005a), such a relation is best 
determined with galaxy-galaxy lensing data and we have re- 
analyzed r-band galaxy-galaxy lensing data from SDSS to 
reestablish the L C (M) relation; The central galaxy mass es- 
timates obtained agrees well with estimates in Mandelbaum 
et al. (2004). In the case of the central galaxy CLF we treat 
(j ccn and M ccn _ cu t as free parameters, while for the satel- 
lite CLF, we take M sat and f3 s as free parameters, with the 
value of 7 fixed at —1 and equation 5 fixed following the 
description below it. For description involving galaxy types, 
we take / con -E, M con , M sat , L sa t, 5 sa t-E, and / sa t-E as free 
parameters. 

While there are 10 free parameters, when model fit- 
ting the data we only consider a smaller subset of these pa- 
rameters for different datasets due to an important reason 
that some statistics are more sensitive to certain parameters 
when compared to others. When considering the LF of the 
total galaxy sample, we fit parameters <T cen , M ce n-cut, M sat , 
and p s , though there are no useful constraints on the latter 
two parameters from the LF. This is clear from Fig. 2, where 
we show that the LF is mostly determined by statistics of 



central galaxies; Another way to explain this is that, at a 
given luminosity, the total density of galaxies is dominated 
by a larger fraction of central galaxies in low mass halos, 
which have a higher density, than satellites of the same lu- 
minosity in more massive halos. 

In the case of LFs of galaxy types, with early- and late- 
type galaxies fitted simultaneously given that parameters 
describing early-type galaxies also describe late-type galax- 
ies, we take a ccn , M con - cut , / ccn -E and M cen as free param- 
eters. Figs. 14 and 15 right panels show constraints on two 
parameters from this parameter set with likelihood of other 
parameters marginalized over. In the case of the total cor- 
relation function, as a function of luminosity from SDSS, 
we fit (J ccn , Mccn -cut, M sat , and f3 s , and show parameter 
constraints on the central galaxy CLF in Figure 14 to be 
compared with constraints for same parameters from the 
total galaxy LF. 

We use the same set of parameters as the ones used to fit 
galaxy type LFs to also fit the correlation functions divided 
to galaxy types from SDSS. The constraints on / CC n-E and 
Mccn are shown in Fig. 15 can be compared with constraints 
for same parameters from the galaxy LF. Since parame- 
ters related to satellite CLF are better described with the 
correlation function, we expanded the parameter space and 
also fitted parameters related to satellite galaxy types. The 
constraints on these parameters, with ones related to cen- 
tral galaxies marginalized over based on LF constraints, are 
shown in Fig. 16. Beyond SDSS, we also consider model fits 
separately to the total clustering data at different redshifts 
separated into galaxy luminosities when available. Here, we 
treat a ce n, M ce n-cut, M sat , and (3 S as free parameters as there 
is no information related to galaxy types in the high redshift 
data except in DEEP2, though we do not use that infor- 
mation explicitly since DEEP2 clustering measurements do 
not probe non-linear clustering in detail. Fig. 17 summarizes 
these results for parameters related to the satellite galaxy 
CLF. 

As shown in Figs. 14 and 15, the total LF and LFs 
galaxy types in SDSS allow better estimates of parameters 
such as (Tccn, the log-normal scatter in the L C (M) relation 
at a given mass, and / con -E the fraction of early-type galax- 
ies that are in halo centers. From the SDSS total LF data 
down to M r < — 17 (from Blanton et al. 2004), <r C cn is con- 
strained to be 0.171q'q2 a t the 68% confidence. In Cooray & 
Milosavljevic (2005b), we found <r C cn ~ 0.22 to describe the 
field-galaxy luminosity function in the K-band (Huang et al. 
2003), while in Cooray (2005a), we suggested a value for the 
dispersion of ~ 0.17 ±0.1 in the 2dFGRS bj band. Unfor- 
tunately, the underlying reason for a difference between the 
dispersion at K-band and lower wavelengths is not under- 
stood. Our estimate for a ccn is in good agreement with the 
value of 0.168 found for the dispersion of central galaxy lu- 
minosities by Yang et al. (2003b) where these authors used 
a completely different parameterization for the CLF based 
on a priori assumed Schechter function shape. When com- 
pared with the Fig. 14 right panel clustering measurements 
do not allow stronger constraints to be placed on these two 
parameters when compared to the constraint based on the 
galaxy LF. This is due to the fact that the correlation func- 
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tion is more strongly sensitive to satellite galaxies rather 
then central galaxies through the non-linear 1-halo term. 

While er con is well determined, we find no evidence for a 
general low-mass cut-off in the central galaxy LF with a 95% 
confidence level on the upper limit of M ccn _ cu t < 3 x 10 10 
Mq. As discussed before this cut-off should not be inter- 
preted as the M m j n parameter in halo occupation number 
models of Zehavi et al. (2004). The cut-off suggested in mod- 
els based on halo occupation number is present in CLFs 
through the L C (M) relation as shown for central galaxy 
CLFs in Fig. 5. We expect a global cut off in the LF if 
effects such as reionization (Benson et al. 2002; Bullock et 
al. 2000; Tully et al. 2002; see, review in Barkana & Loeb 
2001) affect galaxy formation significantly. As discussed in 
Cooray & Cen (2005), the feedback effects may be more com- 
plex than considered before and could depend on the time 
scale of formation relative to the reionization (e.g., Tully 
et al. 2002) and additional heating history of IGM by su- 
pernovae from first galaxies. The galaxy group LFs, down 
to magnitudes below -13, show partial evidence for a cut 
off in the galaxy density corresponding to central galaxies 
at a halo mass around 10 11 Mq with a significant absence 
of dwarf galaxies. On the other hand, dwarf galaxy statis- 
tics in massive clusters, hosted in dark matter halos with 
masses much below the cut-off halo mass in galaxy groups, 
are consistent with the expectation based on the subhalo 
mass function. It is not clear why we do not detect an over- 
all turn over given that such a cut-off is necessary to explain 
the low power-law slope of the fej-band LF of 2dFGRS at 
the faint end (Cooray 2005a) and that galaxies in our sample 
do probe mass scales down to 10 11 Mq. On the other hand, 
the phenomena leading to an absolute lower cut off the halo 
mass hosting galaxies may be local rather then affecting the 
galaxy population as a whole, though this does not explain 
the low-end difference between LFs of SDSS and 2dFGRS. 
In a future study, we plan to analyze the faint-end LF of 
SDSS in detail to address if there is evidence for a global 
cut off. We encourage extending clustering studies of galax- 
ies to fainter magnitudes to obtain a better handle on their 
properties and to extend CLFs down to fainter luminosities 
than possible so far, though due to reasons that clustering 
statistics are not sensitive to central galaxies, it is unlikely 
such measurements alone would be helpful. 

As shown in Fig. 15, the galaxy LF also provides best 
constraints on parameters related to galaxy types that ap- 
pear in halo centers. Marginalizing over other parameters, 
we constrain at 68% confidence level / CO n-E = 0.62 ± 0.19, 
while the mass-scale A/ ccn describing the early-type frac- 
tion of central galaxies is Af CGn = (3.1±|;^) x 10 11 Mq. As 
in Fig. 14, constraints from SDSS galaxy clustering statis- 
tics are lower when compared to constraints from the LF 
for same parameters. While parameters related to central 
galaxies are not well determined by clustering statistics, cer- 
tain parameters related to satellite galaxies are. As shown 
in Fig. 16, while no useful constraint exist for M sa t-type, as 
well as Laat though we do not show its constraint here ex- 
plicitly, / S at-E = 0.5±0.15 while gsat-E = 0.25±0.15 at the 
68% confidence level from SDSS clustering data. 



With clustering statistics, the best constraints are on 
parameters related to the total satellite CLF. In Fig. 17, 
we summarize constraints on parameters j3 B and M sat as a 
function of redshift of the dataset. Surveys such as SDSS and 
COMBO-17 allow these parameters to be determined in de- 
tail. At high redshift, while Subaru data at z ~ 4 from Ouchi 
et al. (2005) allow some constraints, DEEP2 and COMBO- 
17 data only allow an upper limit to be placed on say M sat at 
a given value of /3 S ■ The contours show significant degeneracy 
between these two parameters even in the cases where these 
parameters can be separately measured from each other. For 
example, with SDSS, we find Af sat = (1.2±?;?) x 10 13 h' 1 
Mq with a power-law slope, /3 S , of (0.561° i?) for the to- 
tal luminosity-halo mass relation, both the 68% confidence 
level. The mass limit can be compared to other estimates 
from the literature. For example, based on numerical sim- 
ulations combined with semianalytic models, Zheng et al. 
(2004) finds that even halos of mass io 12 4±01 M host 
satellites with M r < —19. Note that the one-sigma lower 
limit of the allowed range we find from model fitting the data 
is 10 12 Mq. Our results generally applies to galaxies with 
M r < —17. If we concentrate on galaxies with M r < — 19 
only, we again find that the lower limit does not change sig- 
nificantly suggesting that the appearance of satellite galaxies 
with M r < -19 in Zheng et al. (2004; see their Figure 11) 
in halos with mass above io 12 - 4±01 Mq is not contradicted 
by SDSS clustering data. 

At 2 ~ 0.6, COMBO-17 data allows these parame- 
ters for Mb < — 18 galaxies to be constrained as M sat = 
(3.3±3^) x 10 13 h- 1 M and /3„ = (0.62±°; 33 ) at the 68% 
confidence level, respectively, while at z ~ 4, Subaru mea- 
surements constrain these parameters for Mb < —18.5 
galaxies as (4.12±|g°) x 10 12 h' 1 M and (0.55l^ 2 ), re- 
spectively. The large range allowed for Af sat , over an order 
of magnitude in mass at the 68% confidence level, supports 
the suggestion in Zehavi et al. (2004) that halo occupation 
models suggested there are not unique. This large range also 
shows that halo occupation number predicted here and the 
models in Zehavi et al. (2004) are likely to be consistent 
with each other, but given that Zehavi et al. (2004) did not 
present detailed fits to data, a straight forward comparison 
is impossible. 

The degeneracy patterns in Fig. 17, however, suggest 
that a certain combination of (3 S and M sat is better deter- 
mined when compared to these two parameters separately. 
The degeneracy direction is such that as M sat is decreased, 
p s is decreasing as well. Thus, the increase in the total num- 
ber of satellite galaxies, or more appropriately in the con- 
text of CLFs, the satellite luminosity, is compensated by 
the decrease in /3 S so as to conserve the total satellite lumi- 
nosity. To understand this further, we calculate the sample 
averaged total luminosity of satellites, over the luminosity 
distribution of the galaxy sample, as 

(Lsat(z))= (23) 
JdLj dM${L\M,z)^[L tot (M,z)-L c (M,z)] 
JdL$(L,z) 

where L to t(M, z) is given in equation Q. Since Ltot{M, z) is 
a function of parameters f} s and M Ba t, we calculate (L S at(z)) 
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as a function of these two parameters. In Fig. 18, we plot 
contours of constant (L sa t(z)} at redshifts corresponding to 
SDSS and GOODS (at z ~ 3), and, for comparison, we 
also show constraints on this parameter plane from SDSS. 
The comparison reveals that the single parameter best con- 
strained by the combination of /3 S and M sat is (L sat (z)}, 
the sample-averaged total luminosity of satellite galaxies. 
We find a similar behavior with other parameterization of 
Lsat(Af) relation as well. 

In Fig. 19, we plot contours of constant (L S at(z)) at 
the redshift corresponding to Subaru (at z ~ 4), and for 
comparison, in the right panel, contours of constant number 
density of galaxies with Mb < —18.5 at z = 4 (in units 
of 1CT 3 h? Mpc" 3 ) as a function of parameters related to 
the satellite CLF. Just as (L sat (z)) traces the degeneracy 
of the two parameters (5 S and M sat , the same degeneracy is 
traced by contours of n(z = 4) as well. The range allowed 
by constraints on (3 S and M sat is consistent with the value of 
5.8± 1.4 x 1CT 3 h 3 Mpc -3 measured directly in the data by 
Ouchi et al. (2005). As shown in Fig. 19, in fact, the density 
is better constrained by non-linear clustering pattern when 
compared to a direct analysis related to the LF. 

Using (L S at (z)) parameter instead of f3 s and M sat , with 
SDSS, we find (L sat (z < 0.1)) = (2.1±g;l) x 10 10 h~ 2 
L , while with COMBO-17, (L Ba , t (z ~ 0.6)) = (2.4±J;£) x 
10 10 h~ 2 Lq. Moving to higher redshifts, with DEEP2, 
(Lsat (2 ~ 1)) < 3.9 x 10 10 h- 2 L , for GOODS at z ~ 3, 
(L sa ,t(z ~ 3)) < 2 x 10 11 h~ 2 L , and for Subaru at z ~ 4, 
(L aat (z ~ 4)) = (4.2± 2 ' 3 ) x lO 1 " h~ 2 Lq. Based on results 
from SDSS and COMBO-17, if L sat (z) = L sat (z = 0)(l+z) e , 
then we find that e = 0.31 ± 0.52, while between COMBO- 
17 and Subaru at z ~ 4 is e = 0.49 ± 0.74. The differ- 
ence between the two observational wavelength bands be- 
tween SDSS and COMBO-17, r- and B-band respectively, 
and galaxy luminosities in the two samples make this com- 
parison less useful. On the other hand, COMBO-17 sample 
is for galaxies with Mb < — 18 while for Subaru at z ~ 4 
is for galaxies with Mb < —18.5. While there is a small 
difference between the two samples, given the large redshift 
difference, 0.6 and 4 for COMBO-17 and Subaru data re- 
spectively, it is safe to conclude that we find no evidence for 
redshift evolution in the sample-averaged total luminosity of 
satellites. 

This conclusion is in agreement with Yan et al. (2003) 
who compared clustering of galaxies in 2dFGRS at low red- 
shifts and in DEEP2 and suggested no evidence for evolu- 
tion between now and a redshift of unity in the total CLF 
as parameterized by Yang et al. (2003b). Either confirming 
or refuting the redshift evolution could help in understand- 
ing how satellite galaxies merge within halos to form central 
galaxies, whose luminosities do evolve with redshift. Given 
that clustering measurements by Coil et al. (2004) involved 
only a subset of the final DEEP2 galaxy sample, the com- 
plete analysis should improve the estimate of (L sa t) at a red- 
shift of 1, which when combined with SDSS and COMBO- 
17, should improve an estimate on the redshift evolution of 
(Lsat) compared to the estimate here. 

In Figure 20, we present the comparison between con- 
straints on /3 S and M sat parameters from SDSS and Subaru 



and COMBO-17 and Subaru respectively. While constraints 
on Z/ sa t(z) show no evidence for evolution, at the la confi- 
dence level, we find that (3 S and M sat parameters at z = 0.6 
and z — 4 differ each other suggesting that these parameters 
in fact show some evolution. The fact that these parameters 
show differences (as in Figure 20), while a parameter such 
as (L sat ) remain the same may, at the first instance, contra- 
dictory. The difference in parameters such as /3 S and M sat 
between low and high redshifts comes from the difference 
in halo mass functions between redshifts. As the halo mass 
function evolves, there are no high mass halos, and satellites, 
if exist, should be appearing at a lower mass halo. This is 
the general trend we see in Figure 20. If that's the case, one 
could argue that (I/ sa t) should decrease as a function of in- 
creasing redshift. We do not find this behavior as halos at a 
high redshift gets assigned brighter central galaxies than at a 
low redshift due to the redshift evolution in the L C (M) rela- 
tion. This anti-hierarchical behavior is consistent with what 
is generally referred to in the literature as "down-sizing" or 
mass-dependent luminosity evolution where brighter galax- 
ies form first than less luminous galaxies. Since small halos 
are assigned brighter central galaxies at high redshift, given 
our description of the CLF, it is natural that such halos end 
up with brighter satellites as well, relative to a same mass 
halo at a lower redshift. Thus, while (3 S and M sat change with 
redshift, (L sa t) remains the same. Note that in our models 
of the CLF, we have not a priori assumed this behavior. In 
fact, CLF parameters may take any value, and we only note 
this behavior because of the model fitting to the data. Thus, 
our model fit results provide support for apparent brighten- 
ing of galaxies at high redshifts both in the case of central 
galaxies, as discussed in Cooray 2002b, and satellites, as dis- 
cussed here in terms of the clustering statistics. 

In Figure 21, we show constraints on M aa t and j3 s as a 
function of the galaxy luminosity. For clarity, we only plot 
constraints when -19 < M r < -18 and -22 < M r < -21. 
These constraints reveal, though not significant, a trend in 
la constraint on M sa t as a function of the luminosity bin 
such that as galaxy luminosities are increased, M sa t is also 
increased. Such an increase is heavily favored in halo occu- 
pation model fits of Zehavi et al. (2004), though, we find 
that large uncertainties in our model parameters related to 
CLFs do not allow us to establish the same dependence of 
M m in, the minimum mass for the appearance of a central 
galaxy in Zehavi et al. (2004), with galaxy luminosity here 
for the appearance of satellites through our model param- 
eter M sat as a function of luminosity. As stated in Zehavi 
et al. (2004), the halo occupation models shown there are 
not unique and it could be that the largely increasing val- 
ues of M m i n as a function of galaxy luminosity is partly 
accounted through unusually large power-law slopes in the 
halo occupation number models suggested there. It is likely 
that this result can be further improved with galaxy-galaxy 
lensing studies, and as we discuss later, more likely with 
cross-clustering between faint and bright galaxies. 

Instead of using galaxy clustering data to establish the 
mass scale at which satellite galaxies begins to appear, as a 
function of the luminosity, one can establish the same rela- 
tion directly from the data if galaxy sample can be divided 
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to a distribution of galaxy groups and clusters, with some 
mechanism to estimate the halo mass of that halo. Following 
this approach, we make use of a suggested catalog of galaxy 
groups in SDSS by Weinmann et al. (2005) and use the lu- 
minosity distribution to study the minimum luminosity of 
galaxies in these halos as a function of the halo mass. In Fig- 
ure 22, we summarize our results where we consider close to 
~ 10 4 groups and clusters in SDSS. The halo masses are 
estimated based on the total luminosity of the halo, though 
due to small number of galaxies when halo masses are be- 
low a few times 10 1 2 Mq, the mass estimates may become 
highly uncertain. The catalog may also be affected by un- 
certain galaxy assignments, especially when a galaxy that 
is parter of a large mass halo such as cluster gets assigned 
systematically to a lower mass galaxy group. Ignoring these 
complications, which affect the low mass end, we see a trend 
in minimum luminosity with halo mass. This trend can be 
roughly described as M r t{Min) « -19.8(M/10 12 M ) 03 . 
Galaxies with luminosities greater than -21 only begin to ap- 
pear in dark matter halos with mass above 10 13 Mq, while 
the mass limit for galaxies with luminosities brighter than 
-17 in the r'-band is below 10 12 Mq. 

The result we derived earlier where we suggest that 
mass limit is (1.2t 2 'i) X 10 13 Mq is for the whole sample and 
is generally weighted by galaxies in magnitude bins between 
-19 to -21 (see, Figure 8 for example). The result based on 
clustering analysis is thus generally consistent with the di- 
rect estimate from the cluster catalog. In Figure 23, we plot 
the constraints on minimum halo mass and the power-law 
slope for individual bins in luminosity between -18 and -22 
(dashed lines). If we make use of the general result that 
M r ,(Min) ^ -19.8(M/10 12 Mq) ' 03 , then we find that 
/3 S > 0.4 at the 3er confidence level. Returning to Figure 21, 
we then see the clear trend between minimum luminosity 
and the halo mass even based on galaxy clustering. 

As a further application of our results, our CLFs can 
be easily used to estimate the average fraction of satellite 
galaxies in dark matter halos over a given luminosity range, 
</ S at(L)>: 



(/sat(L)> = 



JdM$ sat (L|M,z)^ 



(24) 



In Fig. 23, we show contours of constant (/ sat (L)) for several 
luminosity bins between -18 and -22 in M r as appropriate 
for SDSS as a function of j3 s and M sat . For reference, we also 
plot the constraint from SDSS clustering data on these two 
parameters as a function of the luminosity bin. 

To estimate the satellite fraction as a function of the 
luminosity bin, instead of M sat and f3 s as parameters, we 
determine the likelihood for the single parameter (/ aat (L)} 
directly from clustering data. These results are summarized 
in Figure 24. The satellite fraction is in each of the bins 
is 0.13±°;°3. 0.11±°;°2, O.llljg, and 0.12l°;g 3 for galaxies 
with r'-band luminosities of -22 to -21, -20 to -21, -19 to 
-20, and -18 to -19, respectively (see, also, Figure 22). As we 
discussed with respect to Figure 21, there is an indication 
that (5 > 0.4 to be consistent with the minimum luminosity 
of galaxies as a function of the halo mass. Thus, if (3 a > 0.4, 
the satellite fractions are 0.1051^, 0.12±°;°|, 0.13^;^, 



and 0.131°,;^, for galaxies in luminosity bins of -21 to -22, 
-20 to -21, -19 to -20, and -18 to -19, respectively. 

Though our fractions are slightly lower, considering the 
errors, these fractions are fully consistent with the values 
suggested in Mandelbaum et al. (2004) in three luminosity 
bins based on an analysis of galaxy-galaxy lensing data with 
numerical simulations. Given that current data allow a large 
range of satellite fractions, for most practical purposes, one 
can assume that the satellite fraction is a constant with a 
value around 0.1 to 0.2 in luminosity bins between -18 and 
-21 in M r for general prediction calculations (e.g., Slosar et 
al. 2005), though when estimating cosmological parameters 
or other parameter constraints, it may be best to take into 
account suggested variations. Unlike calculations in Man- 
delbaum et al. (2004) or Slosar et al. (2005), in the present 
description of galaxy statistics with CLFs, satellite fraction 
is not an independent free parameter and is only determined 
to the extent that parameters related to the satellite CLF 
are known. Thus, we need not establish the satellite fraction 
separately. 

Involved with the above expression for {L sett (z)), in 
equation l24l is the probability of halo mass to a host a galaxy 
with luminosity L at a redshift z given by 



P(M\L,z)dM 



$(L|M, z) dn(z) 



dM 



dM . 



(25) 



These probabilities are shown in left and right panels of 
Fig. 25 for a faint and a bright sample of galaxies at 
three different redshifts, respectively. The two panels, when 
combined, show the mass-dependent redshift evolution of 
the galaxy luminosity following Cooray (2005b). Luminous 
galaxies at high redshifts are found at lower mass halos 
than dark matter halo masses that corresponds to the same 
galaxy luminosity today. At the faint-end, —18 > M r > 
— 19, regardless of the redshift, faint galaxies are essentially 
found in dark matter halos with a factor of 2 less range in 
mass, though at low redshifts, a 30% or more fraction of 
low-luminous galaxies could be satellites in more massive 
halos. 

In Fig. 26, we show the same probabilities divided into 
three magnitude bins as a function of redshift in four panels. 
When —22 < M r < —21, at z ~ 3.5, galaxies are primarily 
in dark matter halos of mass ~ 3 x 10 12 Mq. In compari- 
son, such galaxies are central galaxies in groups and clusters 
today with masses above 10 14 Mq. The same probabilities 
have been estimated based on galaxy-galaxy weak lensing 
studies in SDSS by Mandelbaum et al. (2004). The mean 
mass estimates, at a given luminosity bin, and the dispersion 
of the mean mass based on probabilities shown in Fig. 26 are 
in agreement with estimates by Mandelbaum et al. (2004). 
For example, probabilities shown in Fig. 26 suggest that the 
mean halo mass for the bin -21 to -22 in M r is about 8 x 10 12 
h _1 Mq which agrees with the mass estimate of 9.71 x 10 12 
h _1 Mq based on NFW fits to galaxy-galaxy lensing data. 
Since galaxy-galaxy lensing measurements trace the galaxy- 
dark matter correlation function while our estimates are 
based solely on galaxy-galaxy clustering, these agreements 
suggest that to the extent probed by current data galaxy dis- 
tribution traces dark matter as assumed in these halo-based 
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models. We will, however, test this assumption in detail in 
an upcoming analysis. 

At high redshifts, the halo masses are again consistent 
with various prior estimates. For example, in Conroy et al. 
(2004), the dark matter halo masses of —22 < M B < —21 
galaxies are measured based on velocity profiles with a halo 
mass estimate of 5.5^2 x 10 12 h _1 Mq. Our probability 
distribution function for halo mass in this luminosity bin 
and at a redshift of unity suggests a mean halo mass of 
5 x 10 12 h _1 Mq in good agreement with this result. The 
agreement of halo masses based on galaxy LFs and prior 
estimates based on clustering etc at higher redshifts, in the 
context of LBGs, is discussed in Cooray (2005b). 

While certain parameters related to the satellite CLF 
are constrained well by current clustering data at low red- 
shifts, parameters related to satellite galaxy types are not. 
The measurements by Zehavi et al. (2004) involve cluster- 
ing of galaxies in the same luminosity bin, as well as the 
clustering of galaxies in the same luminosity bin and the 
same type (except in Figure 7, the cross-clustering between 
early- and late-type galaxies for M r < —21). These mea- 
surements, while useful, do not provide all the information 
related to clustering for the same sample of galaxies. For 
example, to probe the CLF of satellites better one can con- 
sider cross-correlating galaxies that do not have a significant 
overlap in halo mass in terms of the central galaxy CLF. The 
possibility exists when considering a faint and a bright sub- 
sample of galaxies. As shown in Fig. 5, the central galaxy 
CLF for galaxies with —18 < M r < —17 peaks at a halo 
mass of few times 10 11 Mq. The same CLF peaks at a halo 
mass of few times 10 15 Mq when galaxies with luminosities 
-23 < M r < -22 are considered. While the CLF of cen- 
tral galaxies do not overlap, resulting in no contribution to 
the cross-correlation between these two samples, there is a 
certain overlap in the satellite CLF and to a lower extent 
between the central galaxy CLF of the brighter sample and 
the satellite CLF of the fainter sample. 

The cross-correlation of galaxies between these two lu- 
minosity bins will provide an additional handle on the lu- 
minosity distribution of satellites in clusters of galaxies. In 
fact, one can consider cross-correlations between different 
luminosity bins as well as different galaxy types; For ex- 
ample, the cross-correlation between early type galaxies in 
the fainter sample and late-type galaxies in the brighter 
sample. We illustrate the expected cross-correlation between 
-18 < M r < -17 and -23 < M r < -22 galaxies in Fig. 26. 
For reference, in the same plot, we also show the galaxy clus- 
tering correlation function of galaxies measured by SDSS in 
each of the two luminosity bins. While the cross-correlation 
has not been measured in the data yet, we propose these 
additional measurements for the whole sample. Such mea- 
surements, in addition to clustering at each luminosity bin, 
would provide all the information related to galaxy clus- 
tering at the two-point level from any given survey. This 
information could in return help further constrain CLF of 
satellite galaxies as well as the fraction of galaxy types in 
the form of satellites. 

While we have concentrated primarily on the use of 
galaxy LF and clustering measurements to constrain param- 



eters related to central and satellite galaxy CLFs here, a pri- 
mary interest of these statistical measurements is to estab- 
lish global cosmological parameters. This has been achieved 
mostly by combining CMB data, such as from WMAP, with 
large-scale linear clustering with surveys such as SDSS (e.g., 
Tegmark et al. 2004) or with non-linear clustering part mod- 
eled based on a simple parameterization of the halo occu- 
pation number ( Abazajian et al. 2005) . The latter approach 
can be done with clustering measurements at different red- 
shifts and the combination, as a function of redshift, would 
provide additional constraints on the growth evolution of 
density perturbations. The CLF approach suggested here 
may make this a possibility since CLFs provide estimates of 
galaxy bias, both as a function of luminosity and redshift, 
once the galaxy sample used for clustering measurements at 
various redshifts is well defined. While we have not consid- 
ered cosmological parameters measurements here this is of 
significant interest and we hope to return to this once sev- 
eral high-redshift surveys provide more accurate clustering 
measurements for a well defined sample of galaxies. 



5 SUMMARY AND CONCLUSIONS 

To summarize our discussion involving model descriptions 
of the galaxy LF and clustering statistics with CLFs and 
estimates of CLF parameters directly from the data, our 
main results are: 

(1) Instead of the halo occupation number, it may be 
useful to describe galaxy properties through the CLF when 
describing the galaxy LF and clustering statistics. As dis- 
cussed in Section 2, CLFs provide a consistent way to com- 
pare, and understand, differences in measurements between 
different samples conditioned in terms of galaxy properties. 
While occupation numbers have allowed model fits to clus- 
tering statistics, their use is restricted to clustering statis- 
tics alone as LFs cannot easily be described by occupation 
statistics that treat all galaxies the same. 

(2) We have outlined a general procedure to describe 
CLFs of central and satellite galaxies by improving prior 
descriptions of CLFs by a priori assumed Schechter func- 
tion shapes (e.g., Yang et al. 2003b, 2005). The log-normal 
distribution for central galaxies and the power-law assump- 
tion for satellites combine to produce an overall Schechter 
function shape for galaxy LF (e.g., Cooray & Milosavljevic 
2005b), but at the same time, also explain why the cluster 
LF (e.g., Trentham & Tully 2002) cannot be explained with 
a single Schechter function. 

(3) The combination of SDSS LF and clustering data at 
low redshifts and clustering measurements at high redshifts 
allow certain model parameters related to central and satel- 
lite galaxy CLFs be determined from the data. For example, 
the appearance of satellites with luminosities M r < —17 at 
z < 0.1, using a total luminosity-halo mass relation of the 
form L c (M)(M/M Ba , t )s, is constrained with SDSS to be at a 
halo mass of M sat = (1.2±™) x 10 13 hT x Mq with a power- 
law slope p s of (0.56l 017 ) at the 68% confidence level. At 
z ~ 0.6, COMBO-17 data allows these parameters for galax- 
ies with M B < -18 to be constrained as (3.3tg;o) x 10 13 
h^ 1 Mq with a power-law slope of (0.62lg; 33 ) at the 68% 
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confidence level, while at higher redshifts, Subaru measure- 
ments constrain these parameters for Mb < —18.5 galaxies 
as (4.12l| °) x 10 12 h' 1 M and (O.55+0 35), respectively 
at z = 4. DEEP2 and GOODS measurements only allow an 
upper limit on the power-law slope of total luminosity at a 
given minimum halo mass for the appearance of satellites. 

(4) The single parameter well constrained by clustering 
measurements is the average of total satellite galaxy lumi- 
nosity corresponding to the dark matter halo distribution 
probed by the galaxy sample. This parameter traces the de- 
generacy between M sa t, the minimum halo mass in which 
satellites appear, and (3 S . For SDSS, (L sat ) = (2.1±g;|) x 10 10 
h -2 L , while for GOODS at z ~ 3, (L S at) < 2 x 10 11 h -2 
Lq. While current data do not suggest any redshift varia- 
tion in (Lsat), consistent with a prior suggestion (Yan et al. 
2003) that CLFs do not evolve in redshift, at the la con- 
fidence level, we note that parameters related to satellite 
CLFs do change between z ~ 0.6 and z = 4. Such a dif- 
ference is expected given the redshift evolution of the halo 
mass function and the difference in parameters are such that 
the halo masses where brighter satellites appear at high red- 
shifts host fainter satellites at low redshifts. Parameters such 
as the fraction of satellites at a given luminosity are not well 
determined by the data. Such parameters are built into the 
CLF description and does not need to be specified separately 
as in the halo models of Mandelbaum et al. (2004). 

(5) In addition to constraints on central and satellite 
CLFs, we also determine model parameters of the analyti- 
cal relations that describe the fraction of early- and late- type 
galaxies in dark matter halos. We use our CLFs to establish 
probability distribution of halo mass in which galaxies of a 
given luminosity could be found either at halo centers or 
as satellites and find good agreement with prior estimates 
based on an analysis of galaxy-galaxy lensing and direct 
mass estimates based on velocity profiles, among others. 

(6) Finally, to help establish further properties of the 
galaxy distribution, we propose the measurement of cross- 
clustering between galaxies divided into different luminosity 
bins. 
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